Compositions and methods utilizing the yeast ZE01 promoter

ABSTRACT

The invention provides novel yeast promoters useful for controlling the expression of homologous and heterologous nucleic acid molecules in yeast cells. The yeast promoters are induced by a fermentable carbon source, such as glucose, or a non-fermentable carbon source, such as ethanol, or both. Therefore, expression of nucleic acid molecules encoding a polypeptide under the control of the novel yeast promoters may be regulated by varying the level of a fermentable carbon source, or a non-fermentable carbon source, or both.

BACKGROUND OF THE INVENTION

The controlled production in yeast of an enormous variety of usefulproteins or polypeptides can be achieved using recombinant DNAtechnology. Yeast cells can be transformed with yeast expressionvectors, which contain homologous or heterologous nucleic acid moleculesencoding polypeptides (coding sequences). The yeast cells can thenproduce large quantities of the useful proteins or polypeptides in yeastcell culture.

Expression of the nucleic acid molecule encoding a polypeptide by theyeast expression vector is initiated at a region known as the promoter,which is recognized by and bound by RNA polymerase. The RNA polymerasetravels along the DNA, transcribing the information contained in thecoding strand from its 5′ to 3′ end into messenger RNA, which is in turntranslated into a polypeptide having the amino acid sequence for whichthe DNA codes. The present invention provides novel yeast promotersuseful for, inter alia, controlling the expression of homologous andheterologous nucleic acid sequences encoding proteins and polypeptidesin yeast cells.

SUMMARY OF THE INVENTION

It is an object of the invention to provide novel yeast promoters, yeastexpression vectors, and transformed yeast cells. It is a further objectof the invention to provide a method for producing proteins andpolypeptides in yeast cell culture.

In one embodiment of the invention a yeast promoter which comprises atleast 17 contiguous nucleotides of an isolated and purifiedpolynucleotide is provided. The promoter sequences are shown in SEQ IDNO: 1, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4. The promoter isoperative when operably linked to a nucleic acid molecule encoding apolypeptide.

As used herein, the term Apromoter@ refers to a nucleic acid sequencewhich is cable of initiating transcription of a nucleic acid moleculeencoding a polypeptide (coding sequence); a Ayeast promoter@ is capableof initiating transcript of a coding sequence in yeast cells; andApromoter activity@ refers to the level or amount of transcriptioninitiation of a coding sequence, and encompasses any level abovebackground (i.e., the level or amount that occurs in the absence of apromoter; a background level, which is normally zero).

Another embodiment of the invention provides a yeast promoter whichcomprises an isolated and purified polynucleotide. The promotersequences are shown in SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, and SEQID NO.4. The promoter is operative when operably linked to a nucleicacid molecule encoding a polypeptide.

Yet another embodiment of the invention provides a yeast promoterfragment which comprises at least 17 contiguous nucleotides of apolynucleotide. The polynucleotides are shown in SEQ ID NO: 1, SEQ IDNO:2, SEQ ID NO:3, and SEQ ID NO:4. The fragment has promoter activityas determined by cloning the fragment into a yeast expression vector,wherein the fragment is operably linked to a reporter gene, transformingyeast cells with the yeast expression vector, growing the yeast cells inyeast cell culture under conditions favorable for expression of thereporter gene, and assaying the yeast culture for a reporter proteinexpressed by the reporter gene. The expression of the reporter geneindicates the fragment has promoter activity.

Still another embodiment of the invention provides a yeast expressionvector comprising a yeast promoter. The promoter sequences are shown inSEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4. The promoter isoperative when operably linked to a nucleic acid molecule encoding apolypeptide.

A further embodiment of the invention provides a yeast expression vectorwhere activity of the promoter is controlled by varying the level of anon-fermentable carbon source, such as ethanol, in a medium of yeastcells in culture. The yeast cells are transformed with said yeastexpression vector.

In yet another embodiment of the invention, a yeast expression vectorcomprising a yeast promoter which comprises at least 17 contiguousnucleotides of an isolated and purified polynucleotide is provided. Thepromoter sequences are shown in SEQ ID NO: 1, SEQ ID NO:2, and SEQ IDNO:4. Promoter activity is controlled by varying the level of afermentable carbon source in a medium of yeast cells in culture, wherethe yeast cells are transformed with the yeast expression vector. Thefermentable carbon source can be glucose.

Another embodiment of the invention provides a yeast expression vectorcomprising a yeast promoter. The yeast promoter comprises at least 17contiguous nucleotides of an isolated and purified polynucleotide. Thepromoter sequences are shown in SEQ ID NO: 1, SEQ ID NO:2, and SEQ IDNO:4. Promoter activity is controlled by varying the level of afermentable carbon source and a non-fermentable carbon source, such asethanol, in a medium of yeast cells in culture, where the yeast cellsare transformed with the yeast expression vector. The fermentable carbonsource can be glucose. The non-fermentable carbon source can be ethanol.

Still another embodiment of the invention provides a yeast celltransformed with a yeast expression vector. The yeast expression vectorcomprises a yeast promoter. The promoter sequences are shown in SEQ IDNO: 1, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4. The promoter isoperative when operably linked to a nucleic acid molecule encoding apolypeptide.

Yet another embodiment of the invention provides a method for producinga polypeptide. A yeast expression vector is constructed where apolynucleotide encoding the polypeptide is controlled by a yeastpromoter. The yeast promoter comprises at least 17 contiguousnucleotides of an isolated and purified polynucleotide. The promotersequences are shown in SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, and SEQID NO:4. The promoter is operative when operably linked to a nucleicacid molecule encoding a polypeptide. A culture of yeast cells istransformed with the yeast expression vector. The yeast cells aremaintained in culture so that the polypeptide is expressed. Thepolypeptide is then recovered.

Still another embodiment of the invention provides a method forproducing a polypeptide. A nucleic acid molecule encoding thepolypeptide is cloned into an expression vector selected from the groupconsisting of pYLR110P+luc, pYMR251AP+luc, pYMR107P+luc, pZEO1P+luc,pYLR110P, pYMR251AP, pYMR107P, and pZEO1P. The nucleotide acid moleculeis operably linked to a promoter of the expression vector. A culture ofyeast cells is transformed with the yeast expression vector. The yeastcells are maintained in culture so that the polypeptide is expressed andthe polypeptide is then recovered.

Another embodiment of the invention provides a method for producing apolypeptide. A yeast expression vector is constructed where a nucleicacid molecule encoding the polypeptide is controlled by a yeastpromoter. The yeast promoter comprises at least 17 contiguousnucleotides of an isolated and purified polynucleotide. The promotersequences are shown in SEQ ID NO: 1, SEQ ID NO:2, and SEQ ID NO:4. Yeastcells are transformed with the yeast expression vector and aremaintained in culture medium. The expression of the nucleic acidmolecule encoding the polypeptide is controlled by varying the level ofa fermentable carbon source, such as glucose, in the culture medium. Thepolypeptide is then recovered.

Still another embodiment of the invention provides a method forproducing a polypeptide. A yeast expression vector is constructed wherea nucleic acid molecule encoding the polypeptide is controlled by ayeast promoter. The yeast promoter comprises at least 17 contiguousnucleotides of an isolated and purified polynucleotide. The promotersequences are shown in SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, and SEQID NO:4. The promoter is operative when operably linked to a nucleicacid molecule. A culture of yeast cells is transformed with the yeastexpression vector. The yeast cells are maintained in culture medium andthe expression of the nucleic acid molecule encoding the polypeptide iscontrolled by varying the level of a non-fermentable carbons source,such as ethanol, in the culture medium. The polypeptide is thenrecovered.

Another embodiment of the invention provides a method for producing aolypeptide. A yeast expression vector is constructed where a nucleicacid molecule encoding the polypeptide is controlled by a yeastpromoter. The yeast promoter comprises at least 17 contiguousnucleotides of an isolated and purified polynucleotide. The promotersequences are shown in SEQ ID NO: 1, SEQ ID NO:2, and SEQ ID NO:4. Aculture of yeast cells is transformed with the yeast expression vector.The yeast cells are maintained in culture medium and the expression ofthe nucleic acid encoding the polypeptide is controlled by varying thelevel of a fermentable carbon source, such as glucose, and anon-fermentable carbon source, such as ethanol, in the culture medium.The polypeptide is then recovered.

Yet another embodiment of the invention provides a method of identifyinga promoter fragment with promoter activity by generating a fragmentcomprising at least 17 contiguous nucleotides of an isolated andpurified polynucleotide. The polynucleotides are shown in SEQ ID NO: 1,SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4. The fragment is cloned into ayeast expression vector, so that the fragment is operably linked to areporter gene. Yeast cells are transformed with the yeast expressionvector and grown in yeast cell culture under conditions favorable forexpression of the reporter gene. The yeast culture is assayed for areporter protein expressed by the reporter gene. Expression of thereporter gene indicates the fragment has promoter activity.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a map of YEp 13 expression vector.

FIG. 2 schematically illustrates construction of YLR110C and YMR251WApromoter constructs.

FIG. 3 is a map of pPRB1P.

FIG. 4 is a map of pPRB1P+luc.

FIG. 5 is a map of pYLR110P+luc.

FIG. 6 is a is a map of pYMR251AP+luc.

FIG. 7 is a map of pYMR107P+luc.

FIG. 8 is a map of pZEO1P+Iuc.

FIG. 9 is a map pYLR110P.

FIG. 10 is a map of pYMR251AP.

FIG. 11 is a map of pYMR107P.

FIG. 12 is a mapo of pZEO1P.

FIG. 13 schematically illustrates the YLR110C promoter region.

FIG. 14 schematically illustrates the YMR251WA promoter region.

FIG. 15 schematically illustrates the YMR107W promoter region.

FIG. 16 schematically illustrates the ZEO1 promoter region.

DETAILED DESCRIPTION OF THE INVENTION

Novel yeast promoters whose activity can be controlled by a fermentablecarbon source, such as glucose, or a non-fermentable carbon source, suchas ethanol, or both have been identified. The yeast promoters are usefulfor, inter alia, the high level production of proteins or polypeptidesin yeast cell culture.

Yeast Promoters

The isolated and purified promoter polynucleotides of the invention areshown in SEQ ID NO:1 (the YLR110C promoter), SEQ ID NO:2 (the YMR251WApromoter), SEQ ID NO:3 (the YMR107W promoter), and SEQ ID NO:4 (the ZEO1promoter). Yeast promoters comprising as little as 17 nucleic acids havebeen determined to function as promoters. The yeast promoters of theinvention comprise at least 17, 25, 50, 75, 100, 150, 200, 250, 300,350, 400, 450, 500, 600 or 700 contiguous nucleic acids of an isolatedand purified polynucleotide up to the maximum length provided in any oneof the sequences presented he rein, that is, SEQ ID NO: 1, SEQ ID NO:2,SEQ ID NO:3, and SEQ ID NO:4.

Preferably, the promoter polynucleotides are isolated free of othercomponents, such as proteins and lipids. The polynucleotides can be madeby a cell and isolated or can be synthesized n the laboratory, forexample, using an automatic synthesizer or an amplification method suchas PCR.

Naturally occurring variants and artificial sequence variants (that is,those which do not occur in nature) of the promoters are included in theinvention. Variants of the promoters and/or fragments thereof have,along their entire length, sequence identity of at least 90%, andpreferably greater than 95% as determined by the Smith-Waterman homologysearch algorithm as implemented in MPsrch™ program (University ofEdinburgh) using an affine gap search with the following searchparameters: gap open penalty: 12, gap extension penalty: 1.

Fragments of the full-length promoters are also functional as promoters.A promoter fragment of at least 17 contiguous nucleotides may occur atany position along the full-length promoter as shown in SEQ ID NO: 1,SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4. Accordingly, promoter activityof 17 or more contiguous nucleotides occurring anywhere along thefull-length promoter can be analyzed. Fragments of 17, 25, 50, 75, 100,150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650 or 700,nucleotides of the promoters may be constructed by, for example,subjecting an isolated promoter to restriction endonucleases, to 5′- or3′-deletion mutagenesis, to PCR, or to site specific deletion. Acombination of these methods can also be used to generate fragments of apromoter.

The invention further embodies a hybrid promoter, i.e., a promoter thatcomprises more than one promoter or more than one fragment of a promoterfrom which it was derived. The promoter fragments can be derived frommore than one of the promoter sequences shown in SEQ ID NO: 1, SEQ IDNO:2, SEQ ID NO:3 and SEQ ID NO:4. The promoters and fragments can beconstructed as described above, ligated together, and cloned into ayeast expression vector. Where a promoter comprises nucleotides from atleast two polynucleotides selected from the group consisting of SEQ IDNO: 1, SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4, at least 5,6,7,8,9,10,25,50,75,100,150,200,250,300,350, 400, 450, 500, 550, 600, or 650contiguous nucleotides are derived from each of the lynucleotides toform a promoter of at least 17 nucleotides. Alternatively, each of thefull-length promoters can be combined with another full-length promoteror with fragments of other promoter.

The yeast promoters, fragments of the promoters, and hybrid promotersare useful for controlling expression of a protein or polypeptide whenthe yeast promoter is operably linked to a nucleic acid moleculeencoding the protein or polypeptide.

Determination of Promoter Activity

Promoters and fragments of promoters can be assayed for promoteractivity by cloning a fragment of a promoter, or a full-length promoter,or a hybrid promoter into a yeast expression vector so that is operablylinked to a reporter gene, i.e., a coding sequence for a reporterprotein. The yeast expression vector is transformed in yeast cells,which are grown in yeast cell culture, under conditions favorable forexpression of the reporter gene, for example, under conditions providinga fermentable and/or non-fermentable carbon source. Expression of thereporter gene, as determined by an assay for the amount of a reporterprotein expressed by the reporter gene, indicates that the promoter hasactivity.

For example, to determine if a promoter has activity, i.e. is operative,expression of a reporter gene by a promoter of the invention may becompared to expression of the reporter gene by a reference promoter suchas PBR1 (Cottingham et al. (1991) Eur J Biochem 196(2):431-8; Sleep etal. (1991) Biotechnology 9(2):183-7; Finnis et at. (1992) Yeast8(1):57-60; Meldgaard et al.(1995) Glycoconj J 12(3):380-90; Bach et al.(1996) Receptors and Channels 4(2):129-39. A promoter, a fragment of apromoter, or a hybrid promoter of the invention is operative if itexpresses at least 25% of the amount of a reporter protein as thefull-length PBR1 promoter in a medium containing a non-fermentablecarbon source, or a fermentable carbon source, or both. Preferably, anoperative promoter expresses at least 50%, 75%, 100%, 200%, 300%, 400%,or more of the amount of a reporter protein as the full-length PBR1reference promoter.

Assays for promoter activity are useful for identifying yeast promoterswith high activity and the specific nucleotide sequences of thepromoters that are necessary for promoter activity.

Yeast Expression Vectors

The yeast promoters of the invention, which comprise isolated andpurified polynucleotides selected from the group consisting of SEQ IDNO: 1, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4 or fragments thereof,can be used to construct yeast expression vectors.

Yeast expression vectors are any vectors capable of autonomousreplication within a yeast host organism or capable of integrating intothe yeast genome. Yeast expression vectors are useful for introducingforeign DNA into yeast cells. Typical yeast expression vectors includeyeast integrative plasmids (YIp), yeast replicating plasmids (YRp),yeast expression plasmids (YXp), yeast centromere-containing plasmids(YCp), and yeast episomal plasmids (YEp). Preferably, a yeast expressionvector can be selected and maintained in both yeast and E. coli.

Yeast expression vectors, typically plasmids, incorporate the yeastpromoters of the invention to control expression of nucleic acidmolecules encoding heterologous or homologous proteins or polypeptides.The nucleic acid molecules are operably linked to a promoter in theyeast expression vector. A wide range of heterologous eukaryotic andprokaryotic proteins or peptides may be expressed by the vectors of theinvention.

Expression vectors incorporating the promoters can be constructed byinserting into a vector a nucleic acid molecule encoding a protein orpolypeptide (coding sequence) which is to be expressed. The codingsequence can be inserted at a restriction site which is provideddownstream of a translation start codon controlled by the promoter. Thecoding sequence must be inserted in the correct translational readingframe.

Alternatively, the polynucleotide can itself be provided with atranslational start codon followed directly by a coding sequence. Wherethe promoter does not contain a translational start codon, a restrictionsite is provided so that the coding sequence can be inserted in thecorrect reading fame and so that its translational start codon iscorrectly positioned in relation to the promoter. The coding sequencecan encode heterologous or homologous or eukaryotic or prokaryoticpolypeptides or proteins. In a preferred embodiment the coding sequenceencodes a fusion protein. The coding sequence may further comprise asignal sequence.

In addition to the promoters of the invention, other components can beadded to the expression vectors of the invention. For example, yeastselective markers, such as LEU2 or TRP1, which allow for selection ofyeast cells that have been effectively transformed by the vector can beadded. A yeast replication origin, such as the replication origin of the2-micron plasmid or the autonomous ARS replication segment can be added.Upstream activating sequences and transcription terminator sequences maybe added. Further, at least a portion of a bacterial plasmid, such asfound in YEp13, can be added to enable the yeast expression vectormanipulated in an intermediate bacterial host system, such asEscherichia coli.

The expression vector may also comprise a reporter gene which encodes,for example, β-galactosidase or luciferase. The reporter gene can beunder the control of a promoter of the invention. Where the reportergene, ie., coding sequence, is linked to a gene encoding a desiredprotein, assaying the level of expression of the reporter protein canquickly and easily determine the level of expression of the desiredprotein.

The expression vectors of the invention can be used to direct thefermentable carbon source- and/or non-fermentable carbon source-inducedhigh level expression of proteins or polypeptides in yeast. Thepromoters of the invention can be induced by the presence of afermentable carbon source, such as glucose, or a non-fermentable carbonsource, such as ethanol, or both. That is, the promoters have greaterpromoter activity in the presence of a fermentable carbon source, or anon-fermentable carbon source, or both than in the absence of afermentable carbon source, or a non-fermentable carbon source, or both.Promoters YLR110C, as shown in SEQ ID NO: 1; YMR251WA, as shown in SEQID NO:2; and ZEO1, as shown in SEQ ID NO:4, can be induced by afermentable carbon source, such as glucose, or by a non-fermentablecarbon source, such as ethanol, or by both. Promoter YMR107W, as shownin SEQ ID NO:3, can be induced by a non-fermentable carbon source, suchas ethanol. Thus, the amount of expression of a homologous orheterologous nucleic acid molecule encoding a protein operably linked tothe promoters of the invention can be controlled by varying the amountof an available fermentable carbon source, such as glucose, or anon-fermentable carbon source, such as ethanol, or both.

Transformed Yeast Cells

Yeast cells can be transformed with the yeast expression vectors of theinvention. Transformation can be accomplished by well known methods,including, but not limited to electroporation, calcium phosphateprecipitation, and microinjection. The yeast expression vectors of theinvention can be used to transform yeast cells, including, but notlimited to Saccharomyces cerevisiae, S. uvarum, S. carlsbergensis,Saccharomycopsis lipolytica, Schizosacch romyces pombe, andKtuyveromyces lactis.

Transformed yeast cells containing a yeast expression vector can begrown in an appropriate medium for the yeast. A fermentable ornon-fermentable carbon source can be added to the yeast culture mediumin order to control the activity of the promoter.

Methods of Production of Proteins

Yeast cells transformed with expression vectors comprising a promoter ofthe invention can be used to produce proteins and polypeptides. Underproper cell culture conditions, preferably in the presence of afermentable or non-fermentable carbon source, or both, the promoters ofthe invention will control expression of a nucleic acid moleculeencoding polypeptide operably linked to the promoter.

The protein or polypeptide can be retained within the yeast cell. Theyeast cells can be then harvested, lysed, and the protein obtained andsubstantially purified in accordance with conventional techniques. Suchtechniques include, but are not limited to chromatography,electrophoresis, extraction, and density gradient centrifugation.

In a preferred embodiment of the invention, the protein or polypeptideto be recovered will further comprise a signal peptide capable oftransporting the protein or polypeptide through the membrane of atransformed yeast cell. The protein or polypeptide can be recovered fromthe culture medium by, for example, adsorption or precipitation.

Further, the proteins and polypeptides may be produced as a fusionprotein, which includes not only the amino acid sequence of the desiredprotein, but also one or more additional proteins. Affinity purificationprotocols can be used to facilitate the isolation of fusion proteins.Typically, a ligand capable of binding with high specificity to anaffinity matrix is chosen as the fusion partner for the desired protein.For example, fusion proteins made with glutathione-S-transferase can beselectively recovered on glutathione-agarose and IgG-Sepharose can beused to affinity purify fusion proteins containing staphylococcalprotein A.

Preferably, the protein or polypeptide of interest can be separated fromthe remainder of the fusion protein., The fusion protein can beconstructed so that a site for proteolytic or chemical cleavage isinserted between the protein of interest and the fusion partner. Forexample, sites for cleavage by collagenase, Factor Xa protease,thrombin, and enterokinase, have been inserted between the fusionpartner and the protein of interest. The protein of interest can be alsocleaved from the remainder of the fusion protein by chemical cleavageby, for example, hydroxylamine, cyanogen bromide (CNBr), orN-chlorosuccinamide.

The following are provided for exemplification purposes only and are notintended to limit the scope of the invention described in broad termsabove. All references cited in this disclosure are incorporated byreference.

EXAMPLE 1 Preparation of Yeast Samples

S. cerevisiae strain 11C.

This example describes the growth of haploid Saccharomyces cerevisiaestrain 11C. It has the genotype: ade2-161, trp1-Δ63, ura3-52, lys2-801,leu2Δ1 &/or leu2-112, his3Δ200 &/or his4-519. 11C was generated bycrossing the strains YPH500 (Mat a ura3-52 lys2-801 ade2-161 trp1-Δ63his 3Δ200 leu2Δ1) (Sikorski and Hieter. (1989) A system of shuttlevectors and yeast host strains designed for efficient manipulation ofDNA in Saccharomyces cerevisiae. Genetics 122: 19-27) and AH22 (MATaleu2-3 leu2-112 his4-519) (Hinnen et al. (1978) Transformation of yeast.Proc. Natl. Acad. Sci. USA 75: 1929-1933).

Three sterile 500 ml conical flasks, each containing 100 ml sterile YPDbroth (Sigma, Cat No. Y-1375) were inoculated with sterile 10 μl loopsof differing quantities of the S. cerevisiae strain 11C from a freshlystreaked YPD plate (Sigma, Cat No. Y-1500), and grown in an orbitalshaker at 30° C., 200 rpm, overnight. The growth of 11C in the threeflasks was measured by absorbance at 600 nm. One flask was deemed to beat the late exponential growth phase (1.98 ODU ml at 600 nm), and thisculture was used to inoculate (50 ml o/n culture per flask) 2 identical5L sterile conical flasks (labeled E and L), each containing 1L sterileYPD broth to a final concentration of ˜0.1 ODU ml. Flasks E and L weregrown in an orbital shaker at 30° C., 200 rpm. 10 ml samples werecollected at times indicated below (Table 1). The samples were treatedas follows: their growth was determined (A600nm), the possibility ofcontamination was checked (using a light microscope), cells wereharvested in a benchtop centrifuge (˜2000×g for 5 minutes), and thesupernatant removed and frozen at −20 C. (samples labeled E0-E3, andL0-L5).

TABLE 1 Growth of cultures E and L as measure by absorbance at 600 nm.Time Time after inoculation Growth of flask E Growth of flask L Point(min) (ODU) (ODU) T0 0 0.099 0.099 T1 310 0.37 0.36 T2 410 0.71 0.72 T3455 0.97 0.92 T4 775 — 3.64 T5 1420 — 6.05

After 455 minutes, a time deemed to be late exponential growth phase inglucose, flask E (i.e. early) was harvested (˜2000×g for 5 minutes),split into 50 ml aliquots, and frozen at −80° C. After 1420 minutes, atime deemed to be growth on ethanol, flask L (i.e. late) was harvested(˜2000×g for 5 minutes), split into 50 ml aliquots, and frozen at −80°C.

Determination of Glucose and Ethanol concentration

Supernatant samples (E0-E3, and L0-L5) were defrosted, and their ethanoland glucose contents were measured using ethanol (Boehringer, Cat. No.176290) and glucose (Boehringer, Cat. No. 176251) detection kitsaccording to manufacturers instructions. The concentrations determinedare shown below in Table 2.

TABLE 2 Glucose and Ethanol concentrations in supernatants of cultures Eand L at different time points. Time after Glucose level Ethanol levelSample inoculation (min) in media (g L⁻¹) in media (g L⁻¹) E0 0 20.0 0.0E1 310 21.8 0.3 E2 410 21.8 0.8 E3 455 21.2 0.87 L0 0 20.0 0.0 L1 31022.2 0.36 L2 410 22.0 0.62 L3 455 20.0 0.87 L4 775 11.8 5.2 L5 1420 0.011.8

It can seen in Table 2 that at the point of culture harvest for E (E3,455 minutes), the cells were still utilizing glucose as a carbon source,while at the point of culture harvest for L (L5, 1420 minutes), glucosewas exhausted, and the cells were utilizing ethanol as a carbon source.Calibration values used to calculate glucose concentrations are shown inTable 3.

Calibration values used to calculate ethanol concentrations are shown inTable 4.

TABLE 3 Glucose standards GLUCOSE STANDARDS g/l OD A340 0 0 0.2 0.2460.4 0.461 0.6 0.726 0.8 0.967 1 1.227

TABLE 4 Ethanol standards ETHANOL STANDARDS g/L OD A340 4.72 0.041 9.440.083 18.88 0.166 37.76 0.322 56.6 0.534 75.5 0.664 94.4 0.846

EXAMPLE 2 Analysis of RNA Levels From Yeast Dimorphic Growth Samples

Total RNA Isolation

To RNA was isolated from 300 ml of culture using the hot phenolprotocol. The frozen ye t pellets were resuspended in lysis buffer (4ml) (0.5 ml Tris-CL (1M, pH 7.5), 1.0 ml EDTA 0.5 M), 2.5 ml 10% SDS,and 46.0 ml ddH₂O) and an equal volume of acid phenol was added anvortexed. Following incubation at 65° C. for one hour (with occasionalvigorous vortexing) the mixture was placed on ice for 10 minutes thencentrifuged (10 minutes). The aqueous layer was transferred to a freshcentrifuge tube and mixed with an equal volume of phenol at roomtemperature. The mixture was centrifuged and an equal volume ofcloroform was mixed with the aqueous layer in a fresh centrifuge tube.Following centrifugation the aqueous layer was transferred to a freshcentrifuge tube and sodium acetate (to a final concentration of 0.3M)and two volumes of 100% ethanol was added to precipitate the RNA. Themixture was placed at −20 C. for 30 minutes then centrifuged for 10minutes to pellet the RNA. The RNA pellet was washed 2-3 times with 70%ethanol hen allowed to dry at room temperature. The pellet wasresuspended in ddH2O (200-500 μ). The RNA was quantitated by measuringOD 260-280. Yield of total RNA was ˜4.5 mg from each culture.

Poly A+RNA Purification

Poly A+RNA was purified from total RNA using Qiagen Oligotex mRNA MidiKit (Qiagen, Cat. No. 70042). 2 mg of total RNA was used as startingmaterial and made up to a volume of 500 μl with DEPC treated H₂O. Tothis 500 μl buffer OBB (2×binding buffer) and 55 μl oligotex suspensionwas added. The “Oligotex mRNA SpinColumn Protocol” from, the kitprotocol booklet was followed. The pelleted mRNA was washed in 200 μl75% ethanol, dried and resuspended in 10 μl DEPC treated H₂. Yield ofPoly A+RNA was ˜8 μg for each sample.

cDNA Synthesis

cDNA was synthesized using the protocol for GeneChip Expression AnalysisManual using reagent from Gibco BRL Life Technologies Superscript ChoiceSystem cat. No. 18090-019. For each sample 5 μg Poly A+RNA was added to100pmol of T7-(dT)₂₄ primer (sequence:GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG-(T)24, HPLC purified) (SE IDNO:15) in a total of 8 μl (made up to volume with DEPC treated H₂O). Thereaction mixture was incubated for 10 minutes at 70° C. in a PerkinElmer PE9600 thermalcycle then put on ice. The following reagents wereadded to the reaction mixture: 4 μl 5× first s rand cDNA buffer; 2μl0.1M DTT; and 1 μl 10 mM dNTP mix. The reaction mixture was mixed andincubated at 37° C. for 2 minutes in a Perkin Elmer PE9600 thernocycle.5 μl SuperScript II reverse transcriptase was then added. The mixturewas incubated at 37° C. for 1 hour in a Perkin Elmer PE9600thermocycler.

The first strand cDNA reaction was placed on ice and the followingreagents added: 91 μl DEPC treated H₂O; 30 μl 5×second strand reactionbuffer; 3 μl 10 mM dNTP mix; 1 μl 10 units/μl E. coli DNA ligase; 4 μl10 units/μl E. coli DNA Polymerase I; and 1 μl 2units/μl RNase H. Themixture was incubated at 16° C. for 2 hours in a Perkin Elmer PE9600thermalcycler. 2 μl 5 units/μl T4 DNA Polymerase was then added. Themixture was incubated or a further 5 minutes at 16° C. in a Perkin ElmerPE9600 thermalcycler. 10 μl 0.5M EDT was then added.

The double stranded DNA was cleaned up by phenol extraction. Thereaction product transferred to a 1.5 ml eppendorf tube and 162 μl TrispH 8.0 saturated phenol was added. The tube was mixed by vortexing, thetube was then centrifuged in a microfuge at 13,000 rpm for 5 minutes.The top fraction was recovered and cDNA precipitated by addition of 60μl 7.5M ammonium acetate plus 4001 μl absolute ethanol. This wasimmediately centrifuged in microfuge at 13,000 rpm for 20 minutes. Thesupernatant fraction was discarded, the pellet was washed in 75% ethanoland then air-dried. The pellet was resuspended in 20 μl DEPC treated H₂O

Synthesis of Biotin-Labeled cRNA by In Vitro Transcription (IVT)

Reagents from Ambion MEGAscript T7 kit, cat. No. 1334, were used for thesynthesis of biotin-labeled cRNA by in vitro transcription (IVT). TheNTP Labeling mix comprised 7.5 mM ATP; 7.5 mM GTP; 5.625 mM UTP; 1.875mM Biotin-16-UTP (Enzo cat No. 42814); 5.62 mM CTP; and 1.875 mMBiotin-11-CTP (Enzo cat No. 42818). The IVT Labeling reaction comprised:14.5 μl NTP Labeling mix; 2 μl 10 ×Ambion Transcription Buffer; 1.5 μlDouble strand cDNA (from above); and 2 μl Ambion T7 Enzyme Mix.

The reaction mixture was incubated for 6 hours at 37° C. in a PerkinElmer PE9600 thermalcycler. The biotinylated CRNA was cleaned up usingQiagen RNeasy kit, cat No. 74103. The RNeasy kit protocol was followedexactly. RNA was eluted in 2 aliquots of 30 μl DEP treated H₂O. The RNAwas precipitated by addition of 6 μl 3M sodium acetate pH 5.5 plus 75 μlabsolute ethanol. The RNA was allowed to precipitate overnight at −20°C. Samples ere centrifuged in a microfuge at 13,000 rpm for 20 minutesto pellet the RNA. The supernatant fraction was discarded and the pelletwas washed in 1 ml of 75% ethanol and then allowed to air dry. Thepellet was then resuspended in 20 μl DEPC treated H₂O. The yield of RNAwas ˜40 μg for each sample.

RNA Fragmentation

11 μg of cRNA was fragmented. 8 μl of 5× Fragmentation buffer (200 mMTris-Acetate pH 8 1, 500 mM potassium acetate, 150 mM magnesium acetate)plus 11 μg cRNA made up to 20 μl with DEPC treated H₂O was used. Thereaction mixture was incubated 94° C. for 35 minutes in a Perkin ElmerPE9600 thermal cycler.

Hybridization to GeneChip Microarray

The hybridization mix comprised: 20 μl (11 μg) of fragmented cRNA; 2.2μl of control oligo B2 (50 mol/μl) (sequence:5′Biotin-GTCAAGATGCTACCGTTCAG 3′ HPLC purified) (SEQ ID NO:16); 2.2 μlHerring Sperm DNA (10 mg/ml); 110 μl 2× Buffer (2mM NaC1, 20 M Tris pH7.6, 0.01% Triton X-1 00); and 85.6 μl DEPC treated H₂O. Thehybridization mix heated to 95° C. l na Techne hot block for 5 minutes,followed by incubation at 40° C. for 5 minutes. The hybridization mixwas clarified by centrifugation in microfuge at 13,000 rpm for 5minutes.

200 μl of supernatant to added to the Genechip cartridge (GeneChipcartridge was previously pre-wetted with 200 μl 1×Buffer and incubatedfor 10 minutes at 40° C. in the rotisseric box of a GeneChiphybridization over 320 (cat No. 800227) at maximum rpm. The sample washybridized to the microrray overnight at 40° C. in aGeneChiphybridization over inthe rosseric at maximum rpm.

Washing and Stainig of Probe Arrays

The hydridization mix was recovered from the GeneChip cartridge and putback in the tube containing the remainder of the sample. 200 μl 6×SSPE-T(6×SSPE plus 0.005% Triton X-100 was applied to the chip and pipetted inand out twice. This process was repeated twice more. Another 200 μl6×SSPE-T was applied to the cartridge and the cartridge was thenincubated for 1 hour at 50° C. at maximum rpm in the GeneChiphybridization oven. The 6×SSPE-T was removed and 200 μl 0.5×SSPE-T wasadded to cartridge. The cartridge was incubated for 15 minutes at 50° C.at maximum rpm in the GeneChip by hybridization oven. The 0.5×SSPE-T wasremoved and the cartridge was re-filled with 200 μl 6×SSPE-T.

The stain solution comprised: 190 μl 6×SSPE-T; 10 μl of 20 mg/mlacetylated BSA; and 2 μl mg/ml conjugated streptavidin:phycoerythrin(Molecular Probes cat. No. S-866). 200 μl 6×SSPE-T was removed from theGeneChip cartridge and 200 μl of stain solution added. The cartridge wasincubated at ambient temperature in a GeneChip hybridization oven atmaximum rpm in the rotisserie for 10 minutes. The stain solution wasremoved and the cartridge was washed by adding 200 μl 6×SSPE-T andpipetting this in and out of the cartridge t . This process was repeatedsix times. The cartridges were then completely filled with 6×SSPE-T andany bubbles removed. Hybridization, washing and staining was repeatedusing the same hybridization mixes until both samples had beenhybridized to each of the four yeast chip sub-set arrays.

Data Collection

Data was collected by scanning the hybridized chips on a Hewlett-PackardGeneArray scanner. A “halo” effect (appearance of stain non-specificallyacross the array image) was seen on one of the scanned images: yeastgrowing in glucose rich media, sub-set C array. Scanning of this arraywas aborted after one scan and the chip was washed twice with 200 μl6×SSPE-T and then re-filled as before. This array was then re-scannedthree times and the data collected was the average of these three scans.All other arrays were scanned four times without problems and the datacollected was the average of the four scans.

EXAMPLE 3 Isolation of Promoters and Construction of Expression Vectors.

PCR Amplification of Promoter Regions from Genoinic DNA

Based on the Saccharomyces cerevisiae genomic sequence in the GenEMBLnucleotide database oligonucleotide primers were designed to amplify thegenomic sequence 5′ to the following ORFs: YLR110C (Johnson et al.(1997) Nature 1997 May 29;387(6632 Suppl):87-90), YMR251WA (common nameHOR7) (Bowman et al. (1997) Nature May 29;387(6632 Suppl):90-3), YMR107W(Bowman et al. (1997) Nature May 29;387 (6632 Suppl):90-3), and YOL109W(common name ZEO1) (Dujon et al. (1997) Nature May 29;387(6632Suppl):98-102). The region amplified was the non-coding regionseparating the selected ORF and the next predicted Saccharomycescerevisiael ORF in the 5′ direction, with a minimum length of 500 bp.

Sequence of Oligonucleotide Primers used to Amplify Promoter DNA

YLR110C-F ATGCAAGCTTCGCGGCCGCCGTCTGATTTCCGTTT SEQ ID NO:5 YLR110C-RCCAGGCCGCATATGTCATATAGTGTTTAAG SEQ ID NO:6 YMR251WA-FAGCTAAGCTTCGCGGCCGCCTTTCGATTAGCACGCAC SEQ ID NO:7 YMR251WA-RAGATACCTTCATATGTTATTATTAGTC SEQ ID NO:8 YMR107W-FAGCTAAGCTTCGCGGCCGCGCAGAAATGATGAAGG SEQ ID NO:9 YMR107W-RATCCATCCCATATGTGATATCTCGATTAG SEQ ID NO:10 ZEO1-FAGCTAAGCTTCGCGGCCGCGGAGGTCTGCTTCACG SEQ ID NO:11 ZEO1-RTACGATCGCATATGTAATTGATATAAACG SEQ ID NO:12

PCR reactions were set up for each primer pair as follows: For YMR251WAand ZEO1 90 μl of Reddy-Load PCR (1.1X) mix, 3.5 mM MgCl₂. (AdvancedBiotechnologies, cat. no. AB-0628); 2 μl of forward primer (100 μM); 2μl of reverse primer (100 μM); 1 μl of S. cerevisiae genomic DNA(Promega G310A, lot 8347702, 276 μg/ml); and 5 μl of H₂O were combined.

For YLR110C and YMR107W 90 μl of Reddy-Load PCR (1.1X) mix, 1.5 mM MgCl₂(Advanced Botechnologies, cat.no. AB-0575); 2 μl of forward primer(100μM); 2 μl of reverse primer (100 μM); 1 μl of S. cerevisiae genomicDNA (Promega G310A, lot 8347702, 276 μg/ml): and 5 μl of H₂O werecombined.

The thermocycling was carried out as follows: For the YMR251WA promoter:94° C. for 5 minutes followed by 30 cycles of: 94° C. for 30 seconds,60° C. for 30 second, 72° C. for 1 minute, followed by 72° C. for 5minutes. The reaction mixtures were then held at 4° C. For the YMR107Wand ZEO1 promoters: 94° C. for 5 minutes followed by 30 cycles of: 94°C. for 30 seconds, 45° C. for 30 seconds, 72° C. for 1 minute; followedby 72° C. for 5 minutes. The reaction mixtures were then held at 4° C.For the YLR110C promoter: 94° C. for 5 minutes followed by 30 cycles of:94° C. for 30 seconds, 50° C. for 30 seconds, 72° C. for 1 minute;followed by 72° C. for 5 minutes. The reaction mixtures were then heldat 4° C.

The PCR solutions were loaded onto an LMP gel and the bands werepurified using Wizard PCR reps (Promega, cat. no. A7170) according toprotocol, eluted in 50 μl, ethanol precipitated, and resuspended in 20μl. A map of the YLR110C promoter region is shown in FIG. 13 and SEQ IDNO:29. A map of the YMR251WA promoter region is shown in FIG. 14 and SEQID NO:30. A map of the YMR107W promoter region is shown in FIG. 15 andSEQ NO:31. A map of the ZEO1 promoter region is shown in FIG. 16 and SEQID NO:32.

Cloning Promoter Regions Into a Yeast Vector Containing the LuciferaseGene

The PCR products representing the regions upstream of the YLR110C andYMR251W ORFs were cloned into the suitably digested YEp13-basedmulticopy yeast expression vector pPRB1P+luc. A map of YEp13 is shown inFIG. 1. The Accession number for YEp13 is U03498. A map of pPRB1P isshown in FIG. 2. The sequence of pPRB1P is shown in SEQ ID NO:27: A mapof pPRB1P+luc is shown in FIG. 3 and the sequence is shown in SEQ IDNO:28. The PRB1 promoter was removed from the vector by digesting withthe restriction enzymes HindIII/and NdeI. The digested backbone was thenligated wit a HindIII/NdeI digested PCR product. See FIG. 4.

The PCR products described below, and maxi-prepped pPRB1 P+luc weredigested as follows. 60 μl of pPRB1+luc (328 μg/ml), 10 μl of Hind III(Life Technologies, cat.no. 5207-012, 10 units/μl), 10 μl Ndel(Amersham, cat.no. E0216Y, 20 units/μl), 10 μl NEBuffer 2 (NEB, cat no.007-2), and 10 μl of H₂O 14 μl YLR110C, 2 μl of Hind III (LifeTechnologies cat.no. 15207-012, 10 units/μl Nde I (Amersham, cat.no.E0216Y, 20 units/μl), and 2 μl NEBuffer 2 (NEB, cat.no. 007-2). 14 μlYMR251WA, 2 μl of Hind III (Life Technologies, cat.no. 15207-012, 10units/μl), 2 μl Nde I (Amersham, cat.no. E0216Y, 20 units/μl), and 2 μlNEBuffer 2 (NEB, cat.no. 007-2). The solutions were allowed to react at37° C., for hours.

The (Double digested pPRB1P+luc backbone was purified on an LMP gelusing Wizard PCR preps (Promega, cat. no. A7170), and then ethanolprecipitated. The remaining digestion products were also ethanolprecipitated. The pPBR1P+luc digests were resuspended in 60 μl of H₂Oand the PCR product digests were resuspended in 20 μl .

Ligation reactions were then carried out between each promoter regionand the digested pPRBP1+luc at 16° C. overnight. The PCR productsrepresenting the regions upstream o the following ORFs; YMR107W andZEO1, were prepared, restricted, and ligated essentially as describedabove, however BCL restriction buffer B and different amounts of PCRproduct/volumes were used.

Transformation of Ligation Products into E.coli

The products of the ligations described above were transformed into E.coli (Invitrogen's One-Shot TOP10 Competent cells, cat.no. C4040-10)according to manufacturers protocol. In each case 5μl of the ligationproduct was added to the cell suspension. The total final cellsuspension was plated out onto L-amp plates and incubated overnight at37° C.

Colonies were picked from the plates and PCR screened using the PCRprimers used to amplify the promoters originally. Two positive coloniesfrom each ligation were grown in 5 ml overnight cultures and theirplasmids were purified (Promega Wizard Plus SV Mini-preps, cat. noA1330). The eluted DNA was ethanol precipitated and resuspended in 20 μlof water. Analytical restriction digests were carried out to confirm thepresence of the correct promoter. Clones containing all four promoterconstructs were obtained.

The new constructs were named as follows:

pPRB1 + luc backbone + YLR110C promoter = pYLR110P + luc SEQ ID NO: 19pPRB1 + luc backbone + YMR251WA promoter = pYMR251AP + luc SEQ ID NO: 20pPRB1 + luc backbone + YMR107W promoter = pYMR107P + luc SEQ ID NO: 21pPRB1 + luc backbone + ZEO1 promoter = pZEO1P + luc SEQ ID NO: 22

Map of pYLR110P+luc, pYMR251AP+luc, pYMR107P+luc, and pZEO1P+luc areshown in FIGS. 5, 6, 7, and 8, respectively. Plasmid DNA (pYLR110P+lucand pYMR251P+luc) was prepared for transformation into yeast andsequencing using the QIAGEN Plasmid Maxi kit (Cat.no. 12162). The DNAconcentrations of the maxi-preps (measured by absorbance at 260 nm)were: pYLR110P+luc 463 μg/ml; pYMR251AP+luc 346 μg/ml; pYMR107P+luc ˜300μg/ml; and pZEO1P+luc ˜720 μg/ml. The remaining plasmids weretransformed into yeast as Wizard Plus SV Mini-prep DNA, and maxi-prepDNA was obtained for sequencing using the Gibco BRL Concert Plasmid Maxikit (Cat no. 11452).

Sequencing of Promoter Constructs

DNA (of each of the four promoter constructs were sequenced using theABI PRISM BigDye Terminator Cycle Sequencing Kit (PE Applied Biosystems,part no. 4303153) was used to carry out the sequencing reactions. Eachreaction contained 8 μl of Reaction Mix and 1 μl of 3.2 μM. The volumesof template DNA and H₂O added are as follows: 1.1 μl of pYLR0P+luctemplate and 9.9 μl of water; 1.4 μl of pYMR251AP+luc template and 9.6μlof water; 2.0 6.0 μl of pYMR107P+luc template and 9.0-5.0 μl of water;and 0.5-1.5 μl of pZEO1P+luc template and 10.5-9.5 μl of water.

The thermocycling protocol is described in the ABI protocol, the PCRproducts were ethanol precipitated by adding 3M NaOAc and absoluteEthanol, standing at room temperature for 15 minutes, centrifuging for20 minutes and washing with 250 μl of 70% ethanol. The precipitated DNAwas resuspended in 3 μl of loading dye and 2 μl of each suspension asanalyzed on an PE-AB 377 automated sequencer.

The following promoter constructs pYLR10P+luc and pYMR251AP+luc wereeach sequenced using four primers:

Yep 3 F2: CCTCAATTGGATTAGTCTCA-SEQ ID NO:13-aligns to the Yep13backbone, 290 bp 5′ of the Hind III site.

Luc R1: CACCTCGATATGTGCATCTG -SEQ ID NO:14-aligns to the Luc ORF, 150 bp3′ of the Ndel site.

Forward PCR primer: forward primer used to PCR clone promoter, i.e., SEQID) NO:5 and SEQ NO: 7.

Reverse PCR primer: reverse primer used to PCR clone promoter, i.e., SEQID NO:6 and SEQ ID 0:8.

The remaining promoter constructs (pYMR107P+luc and pZEO1P+luc)were eachsequence using primers Yep 13 F2 and luc R1. Cobining the data from allprimers completely sequenced the promoter regions and spanned thecloning sites of the original vector.

Deviations from Published Genomic Sequences

All sequences differ by a few base pairs around the ATG, this resultsfrom the creation of an Ndel site at the 3′ end of the promoter. Inaddition, the following further alterations from published sequenceswere identified.

pYMR 107P+luc: In the initial construct (for which luciferase reporterdata is described), a cloning artifact led to the junction between thepromoter region and the LUC ORF in pYMR107W+luc to have the sequence:CATATATG (where ATG is the luciferase translational tart site). Thissequence was modified by site directed mutagenesis to create thesequence CATATG, which generates a novel NdeI site at thepromoter/luciferase junction. Subsequent luciferase expression analysisconfirmed that expression from the NdeI site modified pYMR107P+lucconstruct did not differ significantly from the original construct,threfore the sequence of the corrected CATATG construct is includedherein.

Other Modifications

pYMR 107P+luc: Cloning artifacts created an additional HindIIl site andlinker to the 5′ (ie., outside) of the Pymr107p+luc and promoters:

Instead of:

hindiIII Notl promoter 5′

AAGCTT-CGCGGCCGCG-NNNNNNN SEQ ID NO:17

The sequence is:

hind II hindIII Notl promoter 5′

AAGCTT-AGCT-AAGCTT-CGCGGCCGCG-NNNNNNN SEQ ID NO:18.

EXAMPLE4 Luciferase Assays of Promoter Activity

Transformation of S. cerevisiae with Promoter Constructs.

S. cerevisiae strain 11C was transformed with five promoter constructs.This strain carries six metabolic markers, Ade, Trp, Ura, Lys, Leu andHis. It has the genotype: ade2-161, trp1-D63, ura3-52, lys 2-801, leu2D1&/or leu2-3 &/or leu2-112, hisD200 &/or his D200. 11C was generated bycrossing the strains YPH500 (Mat a ura3-52 lys2-80 1 ade2-161 trp1-D63hisD200 leu2D1) and AH22 (MATa leu2-3 leu2-112 his4-519 can1.

11C cell is were streaked from a glycerol stock onto a YPD plate andgrown at 30° C. for two day . The cells were transformed with the five,plasmids, pYLR110P+luc, pYMR251AP+luc, pYMR107P+luc, & pZEO1P+luc andpPRB1P+luc to act as a control. The transformations were carried outusing the Quick and Easy method (Gietz, R. D. and R. A. Woods 1994,Molecular Genetics of Yeast: Practical Approaches pp. 121-134. 10 ml ofplasmid as added to the transformation mix in each case. The wholetransformation mixes were plated out onto -Leu plates and incubated at30° C. for three days. Three individual colonies from eachtransformation plate were picked and used to inoculate 10 ml YPDculture. The 10 ml cultures were incubated in an orbital shaker set to200 rpm and 30° C. Cells ere harvested from the cultures at two points.First, at a point at which the OD of the culture was close to 1.0, atwhich time a 4 m1 sample was taken. Second, a 3 ml sample was takenafter an incubation time of 45 hours. The ODs and incubation time ofeach sample is shown in Table 5. For all harvested samples, the cellswere immediately spun down at 3000 rpm and 4° C., washed in 5 ml ofdH₂O. repelleted and frozen at −20 C.

TABLE 5 OD at time Incubation time OD at time of harvesting atharvesting of harvesting Clone first of first sample second Plasmidnumber 4 ml sample (hours) 3 ml sample pPRB1P + 7 0.98 24.5 4.80 luc 80.68 28 5.56 9 1.15 28 5.66 pYLR110P + 8 1.12 28 5.50 luc 9 0.48 28 4.3810 1.16 24.5 5.51 pYMR251AP + 8 1.20 24.5 4.99 luc 9 1.05 27 4.71 101.15 27 5.18 pYMR107P + 1 1.06 27 5.47 luc 2 0.49 28.5 4.54 3 0.97 25.55.58 pZE01P + 1 1.02 28.5 4.84 luc 2 0.62 28.5 4.97 3 0.42 28.5 4.31

Analysis of Luciferase Activity

All of the samples were analyzed for luciferase activity, using theLucLite Luciferase Reporter Gene Assay Kit (Packard, cat.no 6016911).The cells were prepared by resusspending in PBS and diluting to a finalconcentration of 6×10⁶ cells/ml. 100 ml of each cell suspention waspipetted into wells in duplicate on two 96 well plates, so that eachwell contained 6×10⁵ cells. The plates were incubated at 30° C. for 10minutes. 100 ml of a 1 in 2 dilution of reconstituted substrate wasadded to each well, and the plate was further incubated at roomtemperature for 10 minutes. The luminescence was then measured using thePackard TopCount. The luminescence readings obtained after 0.03 min areshown below in counts per second (CPS) in Table 6.

TABLE 6 Clone First sample Second sample Plasmid number Readings (CPS)Average Average Readings (CPS) Average Average pPRB1P + 7 35890 3569035790 34898 20322 20975 20648 19867 luc 8 25498 25276 25387 24495 5299751778 52388 51607 9 24137 27797 25967 25075 49192 46971 48081 47300pYLR110P + 8 52354 53618 52986 52094 41789 38904 40346 39565 luc 9105299 99776 102537 101645 85562 84468 85015 84234 10 107531 109226108379 107486 22507 22436 22471 21690 PYM4251AP + 8 71993 69797 7089570003 40869 40202 40536 39755 luc 9 98853 98389 98621 97729 51159 4982850493 49712 10 83210 87546 85378 84485 70091 74576 72334 71553pYMR107P + 1 9046 8650 8848 6790 29413 28505 28959 28124 luc 2 3996 4009402 1945 24391 23915 24153 23318 3 3018 3236 3127 1069 23866 23408 2363722802 pZE01P + 1 64137 63162 63649 61592 47469 45769 46619 45784 luc 219579 18329 18954 16897 44910 42982 43946 43111 3 87572 90317 8894486887 142414 142262 142338 141503

TABLE 7 Luciferase Luciferase Expression Expression Promoter mRNA levelsGlucose Ethanol PRB1 Ethanol Induced 1.00 1.00 YLR110C Highly Ethanoland 3.03 1.22 Glucose Induced YMR251WA Highly Ethanol and 2.92 1.35Glucose Induced YMR107W Ethanol Induced 0.21 0.95 ZEO1 Very HighlyEthanol 3.62 2.89 and Glucose Induced

Three promoters give higher levels of expression than PRB1 at both ODs,these are: YLR110C, YMR251WA, and ZEO1. The promoter showing thegreatest fold induction is YMR107W.

Creatine Vectors with Promoters but Without the Luciferase Gene

Based on the analysis of luciferase expression four further promoterconstructs have been made. The lack the luciferase gene and can be usedto clone nucleic acid molecules encoding polypeptides of interestdownstream of the promoters such that they drive expression of thenucleic molecules of interest. The sequences of these four plasmids arenamed: G1: pYLR110P (SEQ ID NO:23) (map at FIG. 9); G2: pYMR251AP (SEQID NO:24) (map at FIG. 10); G3 pYMR107P (SEQ ID NO:25) (map at FIG. 11);and G4: pZE1P (SEQ ID NO:26) (map at FIG. 12). These were constructed bydigesting pPRB1P (SEQ ID NO:27) with HindIII and NdeI to obtain thevector. The promoter+luc construct was digested with HindIII and NdeI toobtain the promoter fragment. The vector and promoter DNA was purifiedfrom LMP agarose using PCRpreps. The vector and promoter was ligated andused to transform E. coli. Correct recombinants were screened for.

EXAMPLE 5

Isolation of Active Promoter Fragments

Operative fragments of the YLR110C, YMR251WA, YMR107W and ZEO1 promoterscan be generated using restriction endonucleases, 5′ or 3′ deletionmutagenesis, PCR, site specific deletion, or a combination thereof. Forexample, purified pYLR1P+luc, pYMR251AP+luc, pYMR107P+luc or pZEO1P+lucplasmids, as generated in Example 3, can be subjected to restrictionendonucleases to generate fragments of the YLR110C , YMR251WA, YMR107Wor ZEO1 promoters. Restriction endonuclease sites, preferably uniquerestriction endonuclease sites, within the promoter sequences shown inSEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4 can beidentified that generate fragments of the promoter upon restrictionendonuclease digestion. Such fragments are preferably, 17, 25, 50, 75,100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650 or 700nucleotides in length.

The fragments generated by restriction endonuclease digestion of thepromoters shown in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:4can be separated by agarose gel electrophoresis. The agarose gel bandcorresponding to the desired promoter fragment can be cut out of theagarose gel. The fragment can be isolated and purified from the agarosegel by, for example, electroelution or kits such as QIAquick™ gelextraction kit or OIAEX® II Gel Extraction System (Qiagen Cat. No. 28704and 20021).

The purified promoter fragment can be ligated into the isolated andpurified HinduIII, NdeI, double-digested pPRBP1+luc backbone such thatthe promoter fragment is operably linked to a luciferase gene andtransformed into E. coli, as described in Example 3. The new expressionvector comprising a fragment of YLR110C, YMR251WA, YMR107W, or ZEO1promoter region can be isolated and purified from E. coli, sequenced,and transformed into yeast as described in Example 3.

To analyze promoter activity, luciferase assays as described in Example4, can be conducted using S. cerevisiae cultures that have beentransformed with the expression vector comprising a fragment of theYLR110 C, YMR251WA, YMR107W, or ZEO1 promoter operably linked to aluciferase gene and S. cerevisiae cultures that have been transformedwith pPRB1P+luc. The S. cerevisiae cultures are grown in mediumcontaining a non-fermentable carbon source, such as ethanol, or afermentable carbon source, such as glucose, or both. Cells are obtainedfrom the cultures and analyzed for luciferase activity as described inExample 4.

A promoter fragment is operative if it expresses at least 75% of theluciferase activity as the PRB 1 promoter. Preferably, an operativepromoter fragment expresses at least 100%, 200%, 300%, 400%, or more ofthe luciferase activity as the PRB1 promoter.

Brief Description of the Sequences

SEQ ID NO: 1 Polynucleotide sequence of promoter YLR110C

SEQ ID NO:2 Polynucleotide sequence of promoter YMR251WA

SEQ ID NO:3 Polynucleotide sequence of promoter YMR 107W

SEQ ID NO:4 Polynucleotide sequence of promoter ZEO1

SEQ ID NO:5 Forward PCR primer for YLR110C

SEQ ID NO:6 Reverse PCR primer for YLR110C

SEQ ID NO:7 Forward PCR primer for YMR251WA

SEQ ID NO:8 Reverse PCR primer for YMR251WA

SEQ ID NO:9 Forward PCR primer for YMR107W

SEQ ID NO:10 Reverse PCR primer for YMR107W

SEQ ID NO:11 Forward PCR primer for ZEO1

SEQ ID NO:12 Reverse PCR primer for ZEO1

SEQ ID NO:13: Yep13 Forward PCR primer

SEQ ID NO:14: Luc R1 Forward PCR primer

SEQ ID NO:15 Primer used in cDNA sequencing

SEQ ID NO:16 Control oligonucleotide used in GeneChip Microarray assay

SEQ ID NO:17 Original pYMR107P+luc sequence

SEQ ID NO:18 Modified pYMR107P+Iuc sequence

SEQ ID NO:19 Nucleotide sequence of pYLR110P+luc

SEQ ID NO:20 Nucleotide sequence of pYMR251AP+luc

SEQ ID NO:21 Nucleotide sequence of pYMR107P+luc

SEQ ID NO:22 Nucleotide sequence of pZEO1P+luc

SEQ ID NO.23 Nucleotide sequence of pYLR110P

SEQ ID NO:24 Nucleotide sequence of pYMR251AP

SEQ ID NO:25 Nucleotide sequence of pYMR107P

SEQ ID NO:26 Nucleotide sequence of pZEO1P

SEQ ID NO:27 Nucleotide sequence of pPRB1P

SEQ ID NO:28 Nucleotide sequence of pPRB1P+luc

SEQ ID NO:29 YLR110C promoter region

SEQ ID NO :30YMR251WA promoter region

SEQ ID NO:31 YMR107W promoter region

SEQ ID NO:32 ZEO1promoter region

32 1 494 DNA Saccharomyces cerevisiae 1 cgtctgattt ccgttttggg aatcctttgccgcgcgcccc tctcaaaact ccgcacaagt 60 cccagaaagc gggaaagaaa taaaacgccaccaaaaaaaa aaaaataaaa gccaatcctc 120 gaagcgtggg tggtaggccc tggattatcccgtacaagta tttctcagga gtaaaaaaac 180 cgtttgtttt ggaattcccc atttcgcggccacctacgcc gctatctttg caacaactat 240 ctgcgataac tcagcaaatt ttgcatattcgtgttgcagt attgcgataa tgggagtctt 300 actcccaaca taacggcaga aagaaatgtgagaaaatttt gcatcctttg cctccgttca 360 agtatataaa gtcggcatgc ttgataatctttctttccat cctacattgt tctaattatt 420 cttattctcc tttattcttt cctaacataccaagaaatta atcttctgtc attcgcttaa 480 acactatatc acat 494 2 723 DNASaccharomyces cerevisiae 2 ctttcgatta gcacgcacac acatcacata gactgcgtcataaaaataca ctacggaaaa 60 accataaaga gcaaagcgat acctacttgg aaggaaaaggagcacgcttg taagggggat 120 gggggctaag aagtcattca ctttcttttc ccttcgcggtccggacccgg gacccctcct 180 ctccccgcac gatttcttcc tttcatatct tccttttattcctatcccgt tgaagcaacc 240 gcactatgac taaatggtgc tggacatctc catggctgtgacttgtgtgt atctcacagt 300 ggtaacggca ccgtggctcg gaaacggttc cttcgtgacaattctagaac aggggctaca 360 gtctcgataa tagaataata agcgcatttt tgctagcgccgccgcggcgc ccgtttccca 420 atagggaggc gcagtttatc ggcggagctc tacttcttcctatttgggta agcccctttc 480 tgttttcggc cagtggttgc tgcaggctgc gccggagaacatagtgataa gggatgtaac 540 tttcgatgag agaattagca agcggaaaaa aactatggctagctgggagt tgtttttcaa 600 tcatataaaa gggagaaatt gttgctcact atgtgacagtttctgggacg tcttaacttt 660 tattgcagag gactatcaaa tcatacagat attgtcaaaaaaaaaaaaga ctaataataa 720 cat 723 3 497 DNA Saccharomyces cerevisiae 3gcagaaatga tgaagggtgt tagcgccgtc cactgatgtg cctggtagtc atgatttacg 60tataactaac acatcatgag gacggcggcg tcaccccaac gcaaaagagt gacttccctg 120cgctttgcca aaaccccata catcgccatc tggctcctgg cagggcggtt gatggacatc 180agccgcctcc cttaattgct aaagcctcca caaggcacaa ttaagcaata tttcgggaaa 240gtacaccagt cagtttgcgc ttttatgact gggttctaag gtactagatg tgaagtagtg 300gtgacagaat cagggagata agagggagca gggtggggta atgatgtgcg ataacaatct 360tgcttggcta atcaccccca tatcttgtag tgagtatata aataggagcc tcccttccta 420ttgcaactcc ataaaatttt tttttgtagc cacttctgta acaagataaa taaaaccaac 480taatcgagat atcacat 497 4 500 DNA Saccharomyces cerevisiae 4 ggaggtctgcttcacgagcg cggtgtgcgc ctagtattgc cccgacggtc cgggtgccta 60 tccctagatttcgtcgtgcc ccgacccaaa tagttaaacg tgtggtttat gggtgcacca 120 gggctttatcgtgttttata tcgatggcga tttgtgcctc cagtgtattt ttgtatatcc 180 aattaaggtttcttacctaa ttttattttt atcatcttta gttaatgctg gtttgctctg 240 tttctgctgctttctgtgcg gttctcctct tctcttgttt cttcgtgttg tcccccatcg 300 ccgatgggcttatatggcgt atatatatag agcgagtttt tacgtcgaag atcatctcag 360 tttgcttgatagcctttcta ctttattact ttcgttttta acctcattat actttagttt 420 tctttgatcggtttttttct ctgtatactt aaaagttcaa atcaaagaaa catacaaaac 480 tacgtttatatcaattacat 500 5 35 DNA Saccharomyces cerevisiae 5 atgcaagctt cgcggccgccgtctgatttc cgttt 35 6 30 DNA Saccharomyces cerevisiae 6 ccaggccgcatatgtcatat agtgtttaag 30 7 37 DNA Saccharomyces cerevisiae 7 agctaagcttcgcggccgcc tttcgattag cacgcac 37 8 27 DNA Saccharomyces cerevisiae 8agataccttc atatgttatt attagtc 27 9 35 DNA Saccharomyces cerevisiae 9agctaagctt cgcggccgcg cagaaatgat gaagg 35 10 29 DNA Saccharomycescerevisiae 10 atccatccca tatgtgatat ctcgattag 29 11 35 DNA Saccharomycescerevisiae 11 agctaagctt cgcggccgcg gaggtctgct tcacg 35 12 29 DNASaccharomyces cerevisiae 12 tacgatcgca tatgtaattg atataaacg 29 13 20 DNASaccharomyces cerevisiae 13 cctcaattgg attagtctca 20 14 20 DNASaccharomyces cerevisiae 14 cacctcgata tgtgcatctg 20 15 63 DNASaccharomyces cerevisiae 15 ggccagtgaa ttgtaatacg actcactata gggaggcggttttttttttt tttttttttt 60 ttt 63 16 20 DNA Saccharomyces cerevisiae 16gtcaagatgc taccgttcag 20 17 23 DNA Saccharomyces cerevisiae unsure(17)..(23) The symbol “n” at positions 17 to 23 represents anynucleotide. 17 aagcttcgcg gccgcgnnnn nnn 23 18 33 DNA Saccharomycescerevisiae unsure (27)..(33) The symbol “n” at positions 27 to 33represents any nucleotide. 18 aagcttagct aagcttcgcg gccgcgnnnn nnn 33 1912844 DNA Saccharomyces cerevisiae 19 aagcttcgcg gccgccgtct gatttccgttttgggaatcc tttgccgcgc gcccctctca 60 aaactccgca caagtcccag aaagcgggaaagaaataaaa cgccaccaaa aaaaaaaaaa 120 taaaagccaa tcctcgaagc gtgggtggtaggccctggat tatcccgtac aagtatttct 180 caggagtaaa aaaaccgttt gttttggaattccccatttc gcggccacct acgccgctat 240 ctttgcaaca actatctgcg ataactcagcaaattttgca tattcgtgtt gcagtattgc 300 gataatggga gtcttacttc caacataacggcagaaagaa atgtgagaaa attttgcatc 360 ctttgcctcc gttcaagtat ataaagtcggcatgcttgat aatctttctt tccatcctac 420 attgttctaa ttattcttat tctcctttattctttcctaa cataccaaga aattaatctt 480 ctgtcattcg cttaaacact atatcacatatggaagacgc caaaaacata aagaaaggcc 540 cggcgccatt ctatccgctg gaagatggaaccgctggaga gcaactgcat aaggctatga 600 agagatacgc cctggttcct ggaacaattgcttttacaga tgcacatatc gaggtggaca 660 tcacttacgc tgagtacttc gaaatgtccgttcggttggc agaagctatg aaacgatatg 720 ggctgaatac aaatcacaga atcgtcgtatgcagtgaaaa ctctcttcaa ttctttatgc 780 cggtgttggg cgcgttattt atcggagttgcagttgcgcc cgcgaacgac atttataatg 840 aacgtgaatt gctcaacagt atgggcatttcgcagcctac cgtggtgttc gtttccaaaa 900 aggggttgca aaaaattttg aacgtgcaaaaaaagctccc aatcatccaa aaaattatta 960 tcatggattc taaaacggat taccagggatttcagtcgat gtacacgttc gtcacatctc 1020 atctacctcc cggttttaat gaatacgattttgtgccaga gtccttcgat agggacaaga 1080 caattgcact gatcatgaac tcctctggatctactggtct gcctaaaggt gtcgctctgc 1140 ctcatagaac tgcctgcgtg agattctcgcatgccagaga tcctattttt ggcaatcaaa 1200 tcattccgga tactgcgatt ttaagtgttgttccattcca tcacggtttt ggaatgttta 1260 ctacactcgg atatttgata tgtggatttcgagtcgtctt aatgtataga tttgaagaag 1320 agctgtttct gaggagcctt caggattacaagattcaaag tgcgctgctg gtgccaaccc 1380 tattctcctt cttcgccaaa agcactctgattgacaaata cgatttatct aatttacacg 1440 aaattgcttc tggtggcgct cccctctctaaggaagtcgg ggaagcggtt gccaagaggt 1500 tccatctgcc aggtatcagg caaggatatgggctcactga gactacatca gctattctga 1560 ttacacccga gggggatgat aaaccgggcgcggtcggtaa agttgttcca ttttttgaag 1620 cgaaggttgt ggatctggat accgggaaaacgctgggcgt taatcaaaga ggcgaactgt 1680 gtgtgagagg tcctatgatt atgtccggttatgtaaacaa tccggaagcg accaacgcct 1740 tgattgacaa ggatggatgg ctacattctggagacatagc ttactgggac gaagacgaac 1800 acttcttcat cgttgaccgc ctgaagtctctgattaagta caaaggctat caggtggctc 1860 ccgctgaatt ggaatccatc ttgctccaacaccccaacat cttcgacgca ggtgtcgcag 1920 gtcttcccga cgatgacgcc ggtgaacttcccgccgccgt tgttgttttg gagcacggaa 1980 agacgatgac ggaaaaagag atcgtggattacgtcgccag tcaagtaaca accgcgaaaa 2040 agttgcgcgg aggagttgtg tttgtggacgaagtaccgaa aggtcttacc ggaaaactcg 2100 acgcaagaaa aatcagagag atcctcataaaggccaagaa gggcggaaag atcgccgtgt 2160 aattggatcc agtttaaaca gtagctttggacttcttcgc cagaggtttg gtcaagtctc 2220 caatcaaggt tgtcggcttg tctaccttgccagaaattta cgaaaagatg gaaaagggtc 2280 aaatcgttgg tagatacgtt gttgacacttctaaataagc gaatttctta tgatttatga 2340 tttttattat taaataagtt ataaaaaaaataagtgtata caaattttaa agtgactctt 2400 aggttttaaa acgaaaattc ttgttcttgagtaactcttt cctgtaggtc aggttgcttt 2460 ctcaggtata gcatgaggtc gctcttattgaccacacctc taccggcatg ccgagcaaat 2520 gcctgcaaat cgctccccat ttcacccaattgtagatatg ctaactccag caatgagttg 2580 atgaatctcg gtgtgtattt tatgtcctcagaagacaaca cctgttgtaa tcgttcttcc 2640 acacggatcg cggccgcttg atcctctacgccggacgcat cgtggccggc atcaccggcg 2700 ccacaggtgc ggttgctggc gcctatatcgccgacatcac cgatggggaa gatcgggctc 2760 gccacttcgg gctcatgagc gcttgtttcggcgtgggtat ggtggcaggc cccgtggccg 2820 ggggactgtt gggcgccatc tccttgcatgcaccattcct tgcggcggcg gtgctcaacg 2880 gcctcaacct actactgggc tgcttcctaatgcaggagtc gcataaggga gagcgtcgac 2940 cgatgccctt gagagccttc aacccagtcagctccttccg gtgggcgcgg ggcatgacta 3000 tcgtcgccgc acttatgact gtcttctttatcatgcaact cgtaggacag gtgccggcag 3060 cgctctgggt cattttcggc gaggaccgctttcgctggag cgcgacgatg atcggcctgt 3120 cgcttgcggt attcggaatc ttgcacgccctcgctcaagc cttcgtcact ggtcccgcca 3180 ccaaacgttt cggcgagaag caggccattatcgccggcat ggcggccgac gcgctgggct 3240 acgtcttgct ggcgttcgcg acgcgaggctggatggcctt ccccattatg attcttctcg 3300 cttccggcgg catcgggatg cccgcgttgcaggccatgct gtccaggcag gtagatgacg 3360 accatcaggg acagcttcaa ggatcgctcgcggctcttac cagcctaact tcgatcactg 3420 gaccgctgat cgtcacggcg atttatgccgcctcggcgag cacatggaac gggttggcat 3480 ggattgtagg cgccgcccta taccttgtctgcctccccgc gttgcgtcgc ggtgcatgga 3540 gccgggccac ctcgacctga atggaagccggcggcacctc gctaacggat tcaccactcc 3600 aagaattgga gccaatcaat tcttgcggagaactgtgaat gcgcaaacca acccttggca 3660 gaacatatcc atcgcgtccg ccatctccagcagccgcacg cggcgcatct cgggcagcgt 3720 tgggtcctgg ccacgggtgc gcatgatcgtgctcctgtcg ttgaggaccc ggctaggctg 3780 gcggggttgc cttactggtt agcagaatgaatcaccgata cgcgagcgaa cgtgaagcga 3840 ctgctgctgc aaaacgtctg cgacctgagcaacaacatga atggtcttcg gtttccgtgt 3900 ttcgtaaagt ctggaaacgc ggaagtcagcgccctgcacc attatgttcc ggatctgcat 3960 cgcaggatgc tgctggctac cctgtggaacacctacatct gtattaacga agcgctggca 4020 ttgaccctga gtgatttttc tctggtcccgccgcatccat accgccagtt gtttaccctc 4080 acaacgttcc agtaaccggg catgttcatcatcagtaacc cgtatcgtga gcatcctctc 4140 tcgtttcatc ggtatcatta cccccatgaacagaaattcc cccttacacg gaggcatcaa 4200 gtgaccaaac aggaaaaaac cgcccttaacatggcccgct ttatcagaag ccagacatta 4260 acgcttctgg agaaactcaa cgagctggacgcggatgaac aggcagacat ctgtgaatcg 4320 cttcacgacc acgctgatga gctttaccgcagctgcctcg cgcgtttcgg tgatgacggt 4380 gaaaacctct gacacatgca gctcccggagacggtcacag cttgtctgta agcggatgcc 4440 gggagcagac aagcccgtca gggcgcgtcagcgggtgttg gcgggtgtcg gggcgcagcc 4500 atgacccagt cacgtagcga tagcggagtgtatactggct taactatgcg gcatcagagc 4560 agattgtact gagagtgcac gatatccggtgtgaaatacc gcacagatgc gtaaggagaa 4620 aataccgcat caggcgctct tccgcttcctcgctcactga ctcgctgcgc tcggtcgttc 4680 ggctgcggcg agcggtatca gctcactcaaaggcggtaat acggttatcc acagaatcag 4740 gggataacgc aggaaagaac atgtgagcaaaaggccagca aaaggccagg aaccgtaaaa 4800 aggccgcgtt gctggcgttt ttccataggctccgcccccc tgacgagcat cacaaaaatc 4860 gacgctcaag tcagaggtgg cgaaacccgacaggactata aagataccag gcgtttcccc 4920 ctggaagctc cctcgtgcgc tctcctgttccgaccctgcc gcttaccgga tacctgtccg 4980 cctttctccc ttcgggaagc gtggcgctttctcaatgctc acgctgtagg tatctcagtt 5040 cggtgtaggt cgttcgctcc aagctgggctgtgtgcacga accccccgtt cagcccgacc 5100 gctgcgcctt atccggtaac tatcgtcttgagtccaaccc ggtaagacac gacttatcgc 5160 cactggcagc agccactggt aacaggattagcagagcgag gtatgtaggc ggtgctacag 5220 agttcttgaa gtggtggcct aactacggctacactagaag gacagtattt ggtatctgcg 5280 ctctgctgaa gccagttacc ttcggaaaaagagttggtag ctcttgatcc ggcaaacaaa 5340 ccaccgctgg tagcggtggt ttttttgtttgcaagcagca gattacgcgc agaaaaaaag 5400 gatctcaaga agatcctttg atcttttctacggggtctga cgctcagtgg aacgaaaact 5460 cacgttaagg gattttggtc atgagattatcaaaaaggat cttcacctag atccttttaa 5520 attaaaaatg aagttttaaa tcaatctaaagtatatatga gtaaacttgg tctgacagtt 5580 accaatgctt aatcagtgag gcacctatctcagcgatctg tctatttcgt tcatccatag 5640 ttgcctgact ccccgtcgtg tagataactacgatacggga gggcttacca tctggcccca 5700 gtgctgcaat gataccgcga gacccacgctcaccggctcc agatttatca gcaataaacc 5760 agccagccgg aagggccgag cgcagaagtggtcctgcaac tttatccgcc tccatccagt 5820 ctattaattg ttgccgggaa gctagagtaagtagttcgcc agttaatagt ttgcgcaacg 5880 ttgttgccat tgctgcaggc atcgtggtgtcacgctcgtc gtttggtatg gcttcattca 5940 gctccggttc ccaacgatca aggcgagttacatgatcccc catgttgtgc aaaaaagcgg 6000 ttagctcctt cggtcctccg atcgttgtcagaagtaagtt ggccgcagtg ttatcactca 6060 tggttatggc agcactgcat aattctcttactgtcatgcc atccgtaaga tgcttttctg 6120 tgactggtga gtactcaacc aagtcattctgagaatagtg tatgcggcga ccgagttgct 6180 cttgcccggc gtcaacacgg gataataccgcgccacatag cagaacttta aaagtgctca 6240 tcattggaaa acgttcttcg gggcgaaaactctcaaggat cttaccgctg ttgagatcca 6300 gttcgatgta acccactcgt gcacccaactgatcttcagc atcttttact ttcaccagcg 6360 tttctgggtg agcaaaaaca ggaaggcaaaatgccgcaaa aaagggaata agggcgacac 6420 ggaaatgttg aatactcata ctcttcctttttcaatatta ttgaagcatt tatcagggtt 6480 attgtctcat gagcggatac atatttgaatgtatttagaa aaataaacaa ataggggttc 6540 cgcgcacatt tccccgaaaa gtgccacctgacgtctaaga aaccattatt atcatgacat 6600 taacctataa aaataggcgt atcacgaggccctttcgtct tcaagaattc cacggactat 6660 agactatact agtatactcc gtctactgtacgatacactt ccgctcaggt ccttgtcctt 6720 taacgaggcc ttaccactct tttgttactctattgatcca gctcagcaaa ggcagtgtga 6780 tctaagattc tatcttcgcg atgtagtaaaactagctaga ccgagaaaga gactagaaat 6840 gcaaaaggca cttctacaat ggctgccatcattattatcc gatgtgacgc tgcagaagca 6900 gaaatacacg cggtcagtga agctattccgctattgaata acctcagtca ccttgtgcaa 6960 gaacttaaca agaaaccaat tattaaaggcttacttactg atagtagatc aacgatcagt 7020 ataattaagt ctacaaatga agagaaatttagaaacagat tttttggcac aaaggcaatg 7080 agacttagag atgaagtatc aggtaataatttatacgtat actacatcga gaccaagaag 7140 aacattgctg atgtgatgac aaaacctcttccgataaaaa catttaaact attaactaac 7200 aaatggattc attagatcta ttacattatgggtggtatgt tggaataaaa atcaactatc 7260 atctactaac tagtatttac gttactagtatattatcata tacggtgtta gaagatgacg 7320 caaatgatga gaaatagtca tctaaattagtggaagctga aacgcaagga ttgataatgt 7380 aataggatca atgaatatta acatataaaatgatgataat aatatttata gaattgtgta 7440 gaattgcaga ttccctttta tggattcctaaatcctcgag gagaacttct agtatatcta 7500 catacctaat attattgcct tattaaaaatggaatcccaa caattacatc aaaatccaca 7560 ttctcttcaa aatcaattgt cctgtacttccttgttcatg tgtgttcaaa aacgttatat 7620 ttataggata attatactct atttctcaacaagtaattgg ttgtttggcc gagcggtcta 7680 aggcgcctga ttcaagaaat atcttgaccgcagttaactg tgggaatact caggtatcgt 7740 aagatgcaag agttcgaatc tcttagcaaccattattttt ttcctcaaca taacgagaac 7800 acacaggggc gctatcgcac agaatcaaattcgatgactg gaaatttttt gttaatttca 7860 gaggtcgcct gacgcatata cctttttcaactgaaaaatt gggagaaaaa ggaaaggtga 7920 gagccgcgga accggctttt catatagaatagagaagcgt tcatgactaa atgcttgcat 7980 cacaatactt gaagttgaca atattatttaaggacctatt gttttttcca ataggtggtt 8040 agcaatcgtc ttactttcta acttttcttaccttttacat ttcagcaata tatatatata 8100 tatttcaagg atataccatt ctaatgtctgcccctaagaa gatcgtcgtt ttgccaggtg 8160 accacgttgg tcaagaaatc acagccgaagccattaaggt tcttaaagct atttctgatg 8220 ttcgttccaa tgtcaagttc gatttcgaaaatcatttaat tggtggtgct gctatcgatg 8280 ctacaggtgt cccacttcca gatgaggcgctggaagcctc caagaaggtt gatgccgttt 8340 tgttaggtgc tgtgggtggt cctaaatggggtaccggtag tgttagacct gaacaaggtt 8400 tactaaaaat ccgtaaagaa cttcaattgtacgccaactt aagaccatgt aactttgcat 8460 ccgactctct tttagactta tctccaatcaagccacaatt tgctaaaggt actgacttcg 8520 ttgttgtcag agaattagtg ggaggtatttactttggtaa gagaaaggaa gacgatggtg 8580 atggtgtcgc ttgggatagt gaacaatacaccgttccaga agtgcaaaga atcacaagaa 8640 tggccgcttt catggcccta caacatgagccaccattgcc tatttggtcc ttggataaag 8700 ctaatgtttt ggcctcttca agattatggagaaaaactgt ggaggaaacc atcaagaacg 8760 aattccctac attgaaggtt caacatcaattgattgattc tgccgccatg atcctagtta 8820 agaacccaac ccacctaaat ggtattataatcaccagcaa catgtttggt gatatcatct 8880 ccgatgaagc ctccgttatc ccaggttccttgggtttgtt gccatctgcg tccttggcct 8940 ctttgccaga caagaacacc gcatttggtttgtacgaacc atgccacggt tctgctccag 9000 atttgccaaa gaataaggtc aaccctatcgccactatctt gtctgctgca atgatgttga 9060 aattgtcatt gaacttgcct gaagaaggtaaggccattga agatgcagtt aaaaaggttt 9120 tggatgcagg tatcagaact ggtgatttaggtggttccaa cagtaccacg gaagtcggtg 9180 atgctgtcgc cgaagaagtt aagaaaatccttgcttaaaa agattctctt tttttatgat 9240 atttgtacat aaactttata aatgaaattcataatagaaa cgacacgaaa ttacaaaatg 9300 gaatatgttc atagggtaga cgaaactatatacgcaatct acatacattt atcaagaagg 9360 agaaaaagga ggatgtaaag gaatacaggtaagcaaattg atactaatgg ctcaacgtga 9420 taaggaaaaa gaattgcact ttaacattaatattgacaag gaggagggca ccacacaaaa 9480 agttaggtgt aacagaaaat catgaaactatgattcctaa tttatatatt ggaggatttt 9540 ctctaaaaaa aaaaaaatac aacaaataaaaaacactcaa tgacctgacc atttgatgga 9600 gtttaagtca ataccttctt gaaccatttcccataatggt gaaagttccc tcaagaattt 9660 tactctgtca gaaacggcct taacgacgtagtcgacctcc tcttcagtac taaatctacc 9720 aataccaaat ctgatggaag aatgggctaatgcatcatcc ttacccagcg catgtaaaac 9780 ataagaaggt tctagggaag cagatgtacaggctgaaccc gaggataatg cgatatccct 9840 tagtgccatc aataaagatt ctccttccacgtaggcgaaa gaaacgttaa cacaccctgg 9900 ataacgatga tctggagatc cgttcaacgtggtatgttca gcggataata gacctttgac 9960 taatttatcg gatagtcttt tgatgtgagcttggtcgttg tcaaattctt tcttcatcaa 10020 tctcgcagct tcaccaaatc ccgctaccaatgggggggcc aaagtaccag atctcaatcc 10080 tctctcttgg ccaccaccgg atagtaaaggttctaatcta actcttggtc tccttcttac 10140 atagatggca cctattccct ttggaccgtaaatcttgtga gaagaaattg atagtaaatc 10200 aatgttcatt tcattgacat caatgtgaatcttaccatag gcttgtgcgg cgtcagtatg 10260 aaagtagatc ttattctttc tacaaattgcaccaatttct ttaataggtt gaatgacacc 10320 gatttcatta ttgacagcca tcacagagacgagacaggta tctggtctaa tggcatcttc 10380 caattccttc aaatcgataa gaccttgatcgtccacattt aggaaagtga cttcaaatcc 10440 ctccttcatc atggcccgtg cggcttccaagacacacttg tgttccgttc tagtggtgat 10500 gatgtgtttc ttagtcttct tataaaatcttgggacaccc ttaagaacca tattattaga 10560 ttcggtcgct cccgaagtga atattatttccttggggtcg gcattgatca tctttgctac 10620 gtaagctcta gcattttcca cagcagtatttgtttcccaa ccgtaagagt gagtgttgga 10680 atgaggatta ccataaagtc ccgtataaaacttcaacatc gtatccaaaa ccctagggtc 10740 tgttggtgta gtggcttgca tgtcaagatatatgggacga gtaccaaaac ctgtgttttc 10800 ttgataagca tggctcattg cagtgctaccagaagctact acagcatctg gggtggtacc 10860 ggatgcactc gcacgggcac tagcctgtgcctttgcagca gcctgaatat cggtatgcgt 10920 ttccagagag aagttgtcgt ctaacttcacgcctgctgca gtctcaatga tattcgaata 10980 cgctttgagg agatacagcc taatatccgacaaactgttt tacagattta cgatcgtact 11040 tgttacccat cattgaattt tgaacatccgaacctgggag ttttccctga aacagatagt 11100 atatttgaac ctgtataata atatatagtctagcgcttta cggaagacaa tgtatgtatt 11160 tcggttcctg gagaaactat tgcatctattgcataggtaa tcttgcacgt cgcatccccg 11220 gttcattttc tgcgtttcca tcttgcacttcaatagcata tctttgttaa cgaagcatct 11280 gtgcttcatt ttgtagaaca aaaatgcaacgcgagagcgc taatttttca aacaaagaat 11340 ctgagctgca tttttacaga acagaaatgcaacgcgaaag cgctatttta ccaacgaaga 11400 atctgtgctt catttttgta aaacaaaaatgcaacgcgag agcgctaatt tttcaaacaa 11460 agaatctgag ctgcattttt acagaacagaaatgcaacgc gagagcgcta ttttaccaac 11520 aaagaatcta tacttctttt ttgttctacaaaaatgcatc ccgagagcgc tatttttcta 11580 acaaagcatc ttagattact ttttttctcctttgtgcgct ctataatgca gtctcttgat 11640 aactttttgc actgtaggtc cgttaaggttagaagaaggc tactttggtg tctattttct 11700 cttccataaa aaaagcctga ctccacttcccgcgtttact gattactagc gaagctgcgg 11760 gtgcattttt tcaagataaa ggcatccccgattatattct ataccgatgt ggattgcgca 11820 tactttgtga acagaaagtg atagcgttgatgattcttca ttggtcagaa aattatgaac 11880 ggtttcttct attttgtctc tatatactacgtataggaaa tgtttacatt ttcgtattgt 11940 tttcgattca ctctatgaat agttcttactacaatttttt tgtctaaaga gtaatactag 12000 agataaacat aaaaaatgta gaggtcgagtttagatgcaa gttcaaggag cgaaaggtgg 12060 atgggtaggt tatataggga tatagcacagagatatatag caaagagata cttttgagca 12120 atgtttgtgg aagcggtatt cgcaatattttagtagctcg ttacagtccg gtgcgttttt 12180 ggttttttga aagtgcgtct tcagagcgcttttggttttc aaaagcgctc tgaagttcct 12240 atactttcta gagaatagga acttcggaataggaacttca aagcgtttcc gaaaacgagc 12300 gcttccgaaa atgcaacgcg agctgcgcacatacagctca ctgttcacgt cgcacctata 12360 tctgcgtgtt gcctgtatat atatatacatgagaagaacg gcatagtgcg tgtttatgct 12420 taaatgcgta cttatatgcg tctatttatgtaggatgaaa ggtagtctag tacctcctgt 12480 gatattatcc cattccatgc ggggtatcgtatgcttcctt cagcactacc ctttagctgt 12540 tctatatgct gccactcctc aattggattagtctcatcct tcaatgctat catttccttt 12600 gatattcgat cctaggcata gtaccgagaaactagtgcga agtagtgatc aggtattgct 12660 gttatctgat gagtatacgt tgtcctggccacggcagaag cacgcttatc gctccaattt 12720 cccacaacat tagtcaactc cgttaggcccttcattgaaa gaaatgaggt catcaaatgt 12780 cttccaatgt gagattttgg gccattttttatagcaaaga ttgaataagg cgcatttttc 12840 ttca 12844 20 13073 DNASaccharomyces cerevisiae 20 aagcttcgcg gccgcctttc gattagcacg cacacacatcacatagactg cgtcataaaa 60 atacactacg gaaaaaccat aaagagcaaa gcgatacctacttggaagga aaaggagcac 120 gcttgtaagg gggatggggg ctaagaagtc attcactttcttttcccttc gcggtccgga 180 cccgggaccc ctcctctccc cgcacgattt cttcctttcatatcttcctt ttattcctat 240 cccgttgaag caaccgcact atgactaaat ggtgctggacatctccatgg ctgtgacttg 300 tgtgtatctc acagtggtaa cggcaccgtg gctcggaaacggttccttcg tgacaattct 360 agaacagggg ctacagtctc gataatagaa taataagcgcatttttgcta gcgccgccgc 420 ggcgcccgtt tcccaatagg gaggcgcagt ttatcggcggagctctactt cttcctattt 480 gggtaagccc ctttctgttt tcggccagtg gttgctgcaggctgcgccgg agaacatagt 540 gataagggat gtaactttcg atgagagaat tagcaagcggaaaaaaacta tggctagctg 600 ggagttgttt ttcaatcata taaaagggag aaattgttgctcactatgtg acagtttctg 660 ggacgtctta acttttattg cagaggacta tcaaatcatacagatattgt caaaaaaaaa 720 aaagactaat aataacatat ggaagacgcc aaaaacataaagaaaggccc ggcgccattc 780 tatccgctgg aagatggaac cgctggagag caactgcataaggctatgaa gagatacgcc 840 ctggttcctg gaacaattgc ttttacagat gcacatatcgaggtggacat cacttacgct 900 gagtacttcg aaatgtccgt tcggttggca gaagctatgaaacgatatgg gctgaataca 960 aatcacagaa tcgtcgtatg cagtgaaaac tctcttcaattctttatgcc ggtgttgggc 1020 gcgttattta tcggagttgc agttgcgccc gcgaacgacatttataatga acgtgaattg 1080 ctcaacagta tgggcatttc gcagcctacc gtggtgttcgtttccaaaaa ggggttgcaa 1140 aaaattttga acgtgcaaaa aaagctccca atcatccaaaaaattattat catggattct 1200 aaaacggatt accagggatt tcagtcgatg tacacgttcgtcacatctca tctacctccc 1260 ggttttaatg aatacgattt tgtgccagag tccttcgatagggacaagac aattgcactg 1320 atcatgaact cctctggatc tactggtctg cctaaaggtgtcgctctgcc tcatagaact 1380 gcctgcgtga gattctcgca tgccagagat cctatttttggcaatcaaat cattccggat 1440 actgcgattt taagtgttgt tccattccat cacggttttggaatgtttac tacactcgga 1500 tatttgatat gtggatttcg agtcgtctta atgtatagatttgaagaaga gctgtttctg 1560 aggagccttc aggattacaa gattcaaagt gcgctgctggtgccaaccct attctccttc 1620 ttcgccaaaa gcactctgat tgacaaatac gatttatctaatttacacga aattgcttct 1680 ggtggcgctc ccctctctaa ggaagtcggg gaagcggttgccaagaggtt ccatctgcca 1740 ggtatcaggc aaggatatgg gctcactgag actacatcagctattctgat tacacccgag 1800 ggggatgata aaccgggcgc ggtcggtaaa gttgttccattttttgaagc gaaggttgtg 1860 gatctggata ccgggaaaac gctgggcgtt aatcaaagaggcgaactgtg tgtgagaggt 1920 cctatgatta tgtccggtta tgtaaacaat ccggaagcgaccaacgcctt gattgacaag 1980 gatggatggc tacattctgg agacatagct tactgggacgaagacgaaca cttcttcatc 2040 gttgaccgcc tgaagtctct gattaagtac aaaggctatcaggtggctcc cgctgaattg 2100 gaatccatct tgctccaaca ccccaacatc ttcgacgcaggtgtcgcagg tcttcccgac 2160 gatgacgccg gtgaacttcc cgccgccgtt gttgttttggagcacggaaa gacgatgacg 2220 gaaaaagaga tcgtggatta cgtcgccagt caagtaacaaccgcgaaaaa gttgcgcgga 2280 ggagttgtgt ttgtggacga agtaccgaaa ggtcttaccggaaaactcga cgcaagaaaa 2340 atcagagaga tcctcataaa ggccaagaag ggcggaaagatcgccgtgta attggatcca 2400 gtttaaacag tagctttgga cttcttcgcc agaggtttggtcaagtctcc aatcaaggtt 2460 gtcggcttgt ctaccttgcc agaaatttac gaaaagatggaaaagggtca aatcgttggt 2520 agatacgttg ttgacacttc taaataagcg aatttcttatgatttatgat ttttattatt 2580 aaataagtta taaaaaaaat aagtgtatac aaattttaaagtgactctta ggttttaaaa 2640 cgaaaattct tgttcttgag taactctttc ctgtaggtcaggttgctttc tcaggtatag 2700 catgaggtcg ctcttattga ccacacctct accggcatgccgagcaaatg cctgcaaatc 2760 gctccccatt tcacccaatt gtagatatgc taactccagcaatgagttga tgaatctcgg 2820 tgtgtatttt atgtcctcag aagacaacac ctgttgtaatcgttcttcca cacggatcgc 2880 ggccgcttga tcctctacgc cggacgcatc gtggccggcatcaccggcgc cacaggtgcg 2940 gttgctggcg cctatatcgc cgacatcacc gatggggaagatcgggctcg ccacttcggg 3000 ctcatgagcg cttgtttcgg cgtgggtatg gtggcaggccccgtggccgg gggactgttg 3060 ggcgccatct ccttgcatgc accattcctt gcggcggcggtgctcaacgg cctcaaccta 3120 ctactgggct gcttcctaat gcaggagtcg cataagggagagcgtcgacc gatgcccttg 3180 agagccttca acccagtcag ctccttccgg tgggcgcggggcatgactat cgtcgccgca 3240 cttatgactg tcttctttat catgcaactc gtaggacaggtgccggcagc gctctgggtc 3300 attttcggcg aggaccgctt tcgctggagc gcgacgatgatcggcctgtc gcttgcggta 3360 ttcggaatct tgcacgccct cgctcaagcc ttcgtcactggtcccgccac caaacgtttc 3420 ggcgagaagc aggccattat cgccggcatg gcggccgacgcgctgggcta cgtcttgctg 3480 gcgttcgcga cgcgaggctg gatggccttc cccattatgattcttctcgc ttccggcggc 3540 atcgggatgc ccgcgttgca ggccatgctg tccaggcaggtagatgacga ccatcaggga 3600 cagcttcaag gatcgctcgc ggctcttacc agcctaacttcgatcactgg accgctgatc 3660 gtcacggcga tttatgccgc ctcggcgagc acatggaacgggttggcatg gattgtaggc 3720 gccgccctat accttgtctg cctccccgcg ttgcgtcgcggtgcatggag ccgggccacc 3780 tcgacctgaa tggaagccgg cggcacctcg ctaacggattcaccactcca agaattggag 3840 ccaatcaatt cttgcggaga actgtgaatg cgcaaaccaacccttggcag aacatatcca 3900 tcgcgtccgc catctccagc agccgcacgc ggcgcatctcgggcagcgtt gggtcctggc 3960 cacgggtgcg catgatcgtg ctcctgtcgt tgaggacccggctaggctgg cggggttgcc 4020 ttactggtta gcagaatgaa tcaccgatac gcgagcgaacgtgaagcgac tgctgctgca 4080 aaacgtctgc gacctgagca acaacatgaa tggtcttcggtttccgtgtt tcgtaaagtc 4140 tggaaacgcg gaagtcagcg ccctgcacca ttatgttccggatctgcatc gcaggatgct 4200 gctggctacc ctgtggaaca cctacatctg tattaacgaagcgctggcat tgaccctgag 4260 tgatttttct ctggtcccgc cgcatccata ccgccagttgtttaccctca caacgttcca 4320 gtaaccgggc atgttcatca tcagtaaccc gtatcgtgagcatcctctct cgtttcatcg 4380 gtatcattac ccccatgaac agaaattccc ccttacacggaggcatcaag tgaccaaaca 4440 ggaaaaaacc gcccttaaca tggcccgctt tatcagaagccagacattaa cgcttctgga 4500 gaaactcaac gagctggacg cggatgaaca ggcagacatctgtgaatcgc ttcacgacca 4560 cgctgatgag ctttaccgca gctgcctcgc gcgtttcggtgatgacggtg aaaacctctg 4620 acacatgcag ctcccggaga cggtcacagc ttgtctgtaagcggatgccg ggagcagaca 4680 agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcggggcgcagcca tgacccagtc 4740 acgtagcgat agcggagtgt atactggctt aactatgcggcatcagagca gattgtactg 4800 agagtgcacg atatccggtg tgaaataccg cacagatgcgtaaggagaaa ataccgcatc 4860 aggcgctctt ccgcttcctc gctcactgac tcgctgcgctcggtcgttcg gctgcggcga 4920 gcggtatcag ctcactcaaa ggcggtaata cggttatccacagaatcagg ggataacgca 4980 ggaaagaaca tgtgagcaaa aggccagcaa aaggccaggaaccgtaaaaa ggccgcgttg 5040 ctggcgtttt tccataggct ccgcccccct gacgagcatcacaaaaatcg acgctcaagt 5100 cagaggtggc gaaacccgac aggactataa agataccaggcgtttccccc tggaagctcc 5160 ctcgtgcgct ctcctgttcc gaccctgccg cttaccggatacctgtccgc ctttctccct 5220 tcgggaagcg tggcgctttc tcaatgctca cgctgtaggtatctcagttc ggtgtaggtc 5280 gttcgctcca agctgggctg tgtgcacgaa ccccccgttcagcccgaccg ctgcgcctta 5340 tccggtaact atcgtcttga gtccaacccg gtaagacacgacttatcgcc actggcagca 5400 gccactggta acaggattag cagagcgagg tatgtaggcggtgctacaga gttcttgaag 5460 tggtggccta actacggcta cactagaagg acagtatttggtatctgcgc tctgctgaag 5520 ccagttacct tcggaaaaag agttggtagc tcttgatccggcaaacaaac caccgctggt 5580 agcggtggtt tttttgtttg caagcagcag attacgcgcagaaaaaaagg atctcaagaa 5640 gatcctttga tcttttctac ggggtctgac gctcagtggaacgaaaactc acgttaaggg 5700 attttggtca tgagattatc aaaaaggatc ttcacctagatccttttaaa ttaaaaatga 5760 agttttaaat caatctaaag tatatatgag taaacttggtctgacagtta ccaatgctta 5820 atcagtgagg cacctatctc agcgatctgt ctatttcgttcatccatagt tgcctgactc 5880 cccgtcgtgt agataactac gatacgggag ggcttaccatctggccccag tgctgcaatg 5940 ataccgcgag acccacgctc accggctcca gatttatcagcaataaacca gccagccgga 6000 agggccgagc gcagaagtgg tcctgcaact ttatccgcctccatccagtc tattaattgt 6060 tgccgggaag ctagagtaag tagttcgcca gttaatagtttgcgcaacgt tgttgccatt 6120 gctgcaggca tcgtggtgtc acgctcgtcg tttggtatggcttcattcag ctccggttcc 6180 caacgatcaa ggcgagttac atgatccccc atgttgtgcaaaaaagcggt tagctccttc 6240 ggtcctccga tcgttgtcag aagtaagttg gccgcagtgttatcactcat ggttatggca 6300 gcactgcata attctcttac tgtcatgcca tccgtaagatgcttttctgt gactggtgag 6360 tactcaacca agtcattctg agaatagtgt atgcggcgaccgagttgctc ttgcccggcg 6420 tcaacacggg ataataccgc gccacatagc agaactttaaaagtgctcat cattggaaaa 6480 cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgttgagatccag ttcgatgtaa 6540 cccactcgtg cacccaactg atcttcagca tcttttactttcaccagcgt ttctgggtga 6600 gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataagggcgacacg gaaatgttga 6660 atactcatac tcttcctttt tcaatattat tgaagcatttatcagggtta ttgtctcatg 6720 agcggataca tatttgaatg tatttagaaa aataaacaaataggggttcc gcgcacattt 6780 ccccgaaaag tgccacctga cgtctaagaa accattattatcatgacatt aacctataaa 6840 aataggcgta tcacgaggcc ctttcgtctt caagaattccacggactata gactatacta 6900 gtatactccg tctactgtac gatacacttc cgctcaggtccttgtccttt aacgaggcct 6960 taccactctt ttgttactct attgatccag ctcagcaaaggcagtgtgat ctaagattct 7020 atcttcgcga tgtagtaaaa ctagctagac cgagaaagagactagaaatg caaaaggcac 7080 ttctacaatg gctgccatca ttattatccg atgtgacgctgcagaagcag aaatacacgc 7140 ggtcagtgaa gctattccgc tattgaataa cctcagtcaccttgtgcaag aacttaacaa 7200 gaaaccaatt attaaaggct tacttactga tagtagatcaacgatcagta taattaagtc 7260 tacaaatgaa gagaaattta gaaacagatt ttttggcacaaaggcaatga gacttagaga 7320 tgaagtatca ggtaataatt tatacgtata ctacatcgagaccaagaaga acattgctga 7380 tgtgatgaca aaacctcttc cgataaaaac atttaaactattaactaaca aatggattca 7440 ttagatctat tacattatgg gtggtatgtt ggaataaaaatcaactatca tctactaact 7500 agtatttacg ttactagtat attatcatat acggtgttagaagatgacgc aaatgatgag 7560 aaatagtcat ctaaattagt ggaagctgaa acgcaaggattgataatgta ataggatcaa 7620 tgaatattaa catataaaat gatgataata atatttatagaattgtgtag aattgcagat 7680 tcccttttat ggattcctaa atcctcgagg agaacttctagtatatctac atacctaata 7740 ttattgcctt attaaaaatg gaatcccaac aattacatcaaaatccacat tctcttcaaa 7800 atcaattgtc ctgtacttcc ttgttcatgt gtgttcaaaaacgttatatt tataggataa 7860 ttatactcta tttctcaaca agtaattggt tgtttggccgagcggtctaa ggcgcctgat 7920 tcaagaaata tcttgaccgc agttaactgt gggaatactcaggtatcgta agatgcaaga 7980 gttcgaatct cttagcaacc attatttttt tcctcaacataacgagaaca cacaggggcg 8040 ctatcgcaca gaatcaaatt cgatgactgg aaattttttgttaatttcag aggtcgcctg 8100 acgcatatac ctttttcaac tgaaaaattg ggagaaaaaggaaaggtgag agccgcggaa 8160 ccggcttttc atatagaata gagaagcgtt catgactaaatgcttgcatc acaatacttg 8220 aagttgacaa tattatttaa ggacctattg ttttttccaataggtggtta gcaatcgtct 8280 tactttctaa cttttcttac cttttacatt tcagcaatatatatatatat atttcaagga 8340 tataccattc taatgtctgc ccctaagaag atcgtcgttttgccaggtga ccacgttggt 8400 caagaaatca cagccgaagc cattaaggtt cttaaagctatttctgatgt tcgttccaat 8460 gtcaagttcg atttcgaaaa tcatttaatt ggtggtgctgctatcgatgc tacaggtgtc 8520 ccacttccag atgaggcgct ggaagcctcc aagaaggttgatgccgtttt gttaggtgct 8580 gtgggtggtc ctaaatgggg taccggtagt gttagacctgaacaaggttt actaaaaatc 8640 cgtaaagaac ttcaattgta cgccaactta agaccatgtaactttgcatc cgactctctt 8700 ttagacttat ctccaatcaa gccacaattt gctaaaggtactgacttcgt tgttgtcaga 8760 gaattagtgg gaggtattta ctttggtaag agaaaggaagacgatggtga tggtgtcgct 8820 tgggatagtg aacaatacac cgttccagaa gtgcaaagaatcacaagaat ggccgctttc 8880 atggccctac aacatgagcc accattgcct atttggtccttggataaagc taatgttttg 8940 gcctcttcaa gattatggag aaaaactgtg gaggaaaccatcaagaacga attccctaca 9000 ttgaaggttc aacatcaatt gattgattct gccgccatgatcctagttaa gaacccaacc 9060 cacctaaatg gtattataat caccagcaac atgtttggtgatatcatctc cgatgaagcc 9120 tccgttatcc caggttcctt gggtttgttg ccatctgcgtccttggcctc tttgccagac 9180 aagaacaccg catttggttt gtacgaacca tgccacggttctgctccaga tttgccaaag 9240 aataaggtca accctatcgc cactatcttg tctgctgcaatgatgttgaa attgtcattg 9300 aacttgcctg aagaaggtaa ggccattgaa gatgcagttaaaaaggtttt ggatgcaggt 9360 atcagaactg gtgatttagg tggttccaac agtaccacggaagtcggtga tgctgtcgcc 9420 gaagaagtta agaaaatcct tgcttaaaaa gattctctttttttatgata tttgtacata 9480 aactttataa atgaaattca taatagaaac gacacgaaattacaaaatgg aatatgttca 9540 tagggtagac gaaactatat acgcaatcta catacatttatcaagaagga gaaaaaggag 9600 gatgtaaagg aatacaggta agcaaattga tactaatggctcaacgtgat aaggaaaaag 9660 aattgcactt taacattaat attgacaagg aggagggcaccacacaaaaa gttaggtgta 9720 acagaaaatc atgaaactat gattcctaat ttatatattggaggattttc tctaaaaaaa 9780 aaaaaataca acaaataaaa aacactcaat gacctgaccatttgatggag tttaagtcaa 9840 taccttcttg aaccatttcc cataatggtg aaagttccctcaagaatttt actctgtcag 9900 aaacggcctt aacgacgtag tcgacctcct cttcagtactaaatctacca ataccaaatc 9960 tgatggaaga atgggctaat gcatcatcct tacccagcgcatgtaaaaca taagaaggtt 10020 ctagggaagc agatgtacag gctgaacccg aggataatgcgatatccctt agtgccatca 10080 ataaagattc tccttccacg taggcgaaag aaacgttaacacaccctgga taacgatgat 10140 ctggagatcc gttcaacgtg gtatgttcag cggataatagacctttgact aatttatcgg 10200 atagtctttt gatgtgagct tggtcgttgt caaattctttcttcatcaat ctcgcagctt 10260 caccaaatcc cgctaccaat gggggggcca aagtaccagatctcaatcct ctctcttggc 10320 caccaccgga tagtaaaggt tctaatctaa ctcttggtctccttcttaca tagatggcac 10380 ctattccctt tggaccgtaa atcttgtgag aagaaattgatagtaaatca atgttcattt 10440 cattgacatc aatgtgaatc ttaccatagg cttgtgcggcgtcagtatga aagtagatct 10500 tattctttct acaaattgca ccaatttctt taataggttgaatgacaccg atttcattat 10560 tgacagccat cacagagacg agacaggtat ctggtctaatggcatcttcc aattccttca 10620 aatcgataag accttgatcg tccacattta ggaaagtgacttcaaatccc tccttcatca 10680 tggcccgtgc ggcttccaag acacacttgt gttccgttctagtggtgatg atgtgtttct 10740 tagtcttctt ataaaatctt gggacaccct taagaaccatattattagat tcggtcgctc 10800 ccgaagtgaa tattatttcc ttggggtcgg cattgatcatctttgctacg taagctctag 10860 cattttccac agcagtattt gtttcccaac cgtaagagtgagtgttggaa tgaggattac 10920 cataaagtcc cgtataaaac ttcaacatcg tatccaaaaccctagggtct gttggtgtag 10980 tggcttgcat gtcaagatat atgggacgag taccaaaacctgtgttttct tgataagcat 11040 ggctcattgc agtgctacca gaagctacta cagcatctggggtggtaccg gatgcactcg 11100 cacgggcact agcctgtgcc tttgcagcag cctgaatatcggtatgcgtt tccagagaga 11160 agttgtcgtc taacttcacg cctgctgcag tctcaatgatattcgaatac gctttgagga 11220 gatacagcct aatatccgac aaactgtttt acagatttacgatcgtactt gttacccatc 11280 attgaatttt gaacatccga acctgggagt tttccctgaaacagatagta tatttgaacc 11340 tgtataataa tatatagtct agcgctttac ggaagacaatgtatgtattt cggttcctgg 11400 agaaactatt gcatctattg cataggtaat cttgcacgtcgcatccccgg ttcattttct 11460 gcgtttccat cttgcacttc aatagcatat ctttgttaacgaagcatctg tgcttcattt 11520 tgtagaacaa aaatgcaacg cgagagcgct aatttttcaaacaaagaatc tgagctgcat 11580 ttttacagaa cagaaatgca acgcgaaagc gctattttaccaacgaagaa tctgtgcttc 11640 atttttgtaa aacaaaaatg caacgcgaga gcgctaatttttcaaacaaa gaatctgagc 11700 tgcattttta cagaacagaa atgcaacgcg agagcgctattttaccaaca aagaatctat 11760 acttcttttt tgttctacaa aaatgcatcc cgagagcgctatttttctaa caaagcatct 11820 tagattactt tttttctcct ttgtgcgctc tataatgcagtctcttgata actttttgca 11880 ctgtaggtcc gttaaggtta gaagaaggct actttggtgtctattttctc ttccataaaa 11940 aaagcctgac tccacttccc gcgtttactg attactagcgaagctgcggg tgcatttttt 12000 caagataaag gcatccccga ttatattcta taccgatgtggattgcgcat actttgtgaa 12060 cagaaagtga tagcgttgat gattcttcat tggtcagaaaattatgaacg gtttcttcta 12120 ttttgtctct atatactacg tataggaaat gtttacattttcgtattgtt ttcgattcac 12180 tctatgaata gttcttacta caattttttt gtctaaagagtaatactaga gataaacata 12240 aaaaatgtag aggtcgagtt tagatgcaag ttcaaggagcgaaaggtgga tgggtaggtt 12300 atatagggat atagcacaga gatatatagc aaagagatacttttgagcaa tgtttgtgga 12360 agcggtattc gcaatatttt agtagctcgt tacagtccggtgcgtttttg gttttttgaa 12420 agtgcgtctt cagagcgctt ttggttttca aaagcgctctgaagttccta tactttctag 12480 agaataggaa cttcggaata ggaacttcaa agcgtttccgaaaacgagcg cttccgaaaa 12540 tgcaacgcga gctgcgcaca tacagctcac tgttcacgtcgcacctatat ctgcgtgttg 12600 cctgtatata tatatacatg agaagaacgg catagtgcgtgtttatgctt aaatgcgtac 12660 ttatatgcgt ctatttatgt aggatgaaag gtagtctagtacctcctgtg atattatccc 12720 attccatgcg gggtatcgta tgcttccttc agcactaccctttagctgtt ctatatgctg 12780 ccactcctca attggattag tctcatcctt caatgctatcatttcctttg atattcgatc 12840 ctaggcatag taccgagaaa ctagtgcgaa gtagtgatcaggtattgctg ttatctgatg 12900 agtatacgtt gtcctggcca cggcagaagc acgcttatcgctccaatttc ccacaacatt 12960 agtcaactcc gttaggccct tcattgaaag aaatgaggtcatcaaatgtc ttccaatgtg 13020 agattttggg ccatttttta tagcaaagat tgaataaggcgcatttttct tca 13073 21 12851 DNA Saccharomyces cerevisiae 21 aagcttagctaagcttcgcg gccgcgcaga aatgatgaag ggtgttagcg ccgtccactg 60 atgtgcctggtagtcatgat ttacgtataa ctaacacatc atgaggacgg cggcgtcacc 120 ccaacgcaaaagagtgactt ccctgcgctt tgccaaaacc ccatacatcg ccatctggct 180 cctggcagggcggttgatgg acatcagccg cctcccttaa ttgctaaagc ctccacaagg 240 cacaattaagcaatatttcg ggaaagtaca ccagtcagtt tgcgctttta tgactgggtt 300 ctaaggtactagatgtgaag tagtggtgac agaatcaggg agataagagg gagcagggtg 360 gggtaatgatgtgcgataac aatcttgctt ggctaatcac ccccatatct tgtagtgagt 420 atataaataggagcctccct tcctattgca actccataaa attttttttt gtagccactt 480 ctgtaacaagataaataaaa ccaactaatc gagatatcac atatggaaga cgccaaaaac 540 ataaagaaaggcccggcgcc attctatccg ctggaagatg gaaccgctgg agagcaactg 600 cataaggctatgaagagata cgccctggtt cctggaacaa ttgcttttac agatgcacat 660 atcgaggtggacatcactta cgctgagtac ttcgaaatgt ccgttcggtt ggcagaagct 720 atgaaacgatatgggctgaa tacaaatcac agaatcgtcg tatgcagtga aaactctctt 780 caattctttatgccggtgtt gggcgcgtta tttatcggag ttgcagttgc gcccgcgaac 840 gacatttataatgaacgtga attgctcaac agtatgggca tttcgcagcc taccgtggtg 900 ttcgtttccaaaaaggggtt gcaaaaaatt ttgaacgtgc aaaaaaagct cccaatcatc 960 caaaaaattattatcatgga ttctaaaacg gattaccagg gatttcagtc gatgtacacg 1020 ttcgtcacatctcatctacc tcccggtttt aatgaatacg attttgtgcc agagtccttc 1080 gatagggacaagacaattgc actgatcatg aactcctctg gatctactgg tctgcctaaa 1140 ggtgtcgctctgcctcatag aactgcctgc gtgagattct cgcatgccag agatcctatt 1200 tttggcaatcaaatcattcc ggatactgcg attttaagtg ttgttccatt ccatcacggt 1260 tttggaatgtttactacact cggatatttg atatgtggat ttcgagtcgt cttaatgtat 1320 agatttgaagaagagctgtt tctgaggagc cttcaggatt acaagattca aagtgcgctg 1380 ctggtgccaaccctattctc cttcttcgcc aaaagcactc tgattgacaa atacgattta 1440 tctaatttacacgaaattgc ttctggtggc gctcccctct ctaaggaagt cggggaagcg 1500 gttgccaagaggttccatct gccaggtatc aggcaaggat atgggctcac tgagactaca 1560 tcagctattctgattacacc cgagggggat gataaaccgg gcgcggtcgg taaagttgtt 1620 ccattttttgaagcgaaggt tgtggatctg gataccggga aaacgctggg cgttaatcaa 1680 agaggcgaactgtgtgtgag aggtcctatg attatgtccg gttatgtaaa caatccggaa 1740 gcgaccaacgccttgattga caaggatgga tggctacatt ctggagacat agcttactgg 1800 gacgaagacgaacacttctt catcgttgac cgcctgaagt ctctgattaa gtacaaaggc 1860 tatcaggtggctcccgctga attggaatcc atcttgctcc aacaccccaa catcttcgac 1920 gcaggtgtcgcaggtcttcc cgacgatgac gccggtgaac ttcccgccgc cgttgttgtt 1980 ttggagcacggaaagacgat gacggaaaaa gagatcgtgg attacgtcgc cagtcaagta 2040 acaaccgcaaaaagttgcgc ggaggagttg tgtttgtgga cgaagtaccg aaaggtctta 2100 ccggaaaactcgacgcaaga aaaatcagag agatcctcat aaaggccaag aagggcggaa 2160 agatcgccgtgtaattggat ccagtttaaa cagtagcttt ggacttcttc gccagaggtt 2220 tggtcaagtctccaatcaag gttgtcggct tgtctacctt gccagaaatt tacgaaaaga 2280 tggaaaagggtcaaatcgtt ggtagatacg ttgttgacac ttctaaataa gcgaatttct 2340 tatgatttatgatttttatt attaaataag ttataaaaaa aataagtgta tacaaatttt 2400 aaagtgactcttaggtttta aaacgaaaat tcttgttctt gagtaactct ttcctgtagg 2460 tcaggttgctttctcaggta tagcatgagg tcgctcttat tgaccacacc tctaccggca 2520 tgccgagcaaatgcctgcaa atcgctcccc atttcaccca attgtagata tgctaactcc 2580 agcaatgagttgatgaatct cggtgtgtat tttatgtcct cagaagacaa cacctgttgt 2640 aatcgttcttccacacggat cgcggccgct tgatcctcta cgccggacgc atcgtggccg 2700 gcatcaccggcgccacaggt gcggttgctg gcgcctatat cgccgacatc accgatgggg 2760 aagatcgggctcgccacttc gggctcatga gcgcttgttt cggcgtgggt atggtggcag 2820 gccccgtggccgggggactg ttgggcgcca tctccttgca tgcaccattc cttgcggcgg 2880 cggtgctcaacggcctcaac ctactactgg gctgcttcct aatgcaggag tcgcataagg 2940 gagagcgtcgaccgatgccc ttgagagcct tcaacccagt cagctccttc cggtgggcgc 3000 ggggcatgactatcgtcgcc gcacttatga ctgtcttctt tatcatgcaa ctcgtaggac 3060 aggtgccggcagcgctctgg gtcattttcg gcgaggaccg ctttcgctgg agcgcgacga 3120 tgatcggcctgtcgcttgcg gtattcggaa tcttgcacgc cctcgctcaa gccttcgtca 3180 ctggtcccgccaccaaacgt ttcggcgaga agcaggccat tatcgccggc atggcggccg 3240 acgcgctgggctacgtcttg ctggcgttcg cgacgcgagg ctggatggcc ttccccatta 3300 tgattcttctcgcttccggc ggcatcggga tgcccgcgtt gcaggccatg ctgtccaggc 3360 aggtagatgacgaccatcag ggacagcttc aaggatcgct cgcggctctt accagcctaa 3420 cttcgatcactggaccgctg atcgtcacgg cgatttatgc cgcctcggcg agcacatgga 3480 acgggttggcatggattgta ggcgccgccc tataccttgt ctgcctcccc gcgttgcgtc 3540 gcggtgcatggagccgggcc acctcgacct gaatggaagc cggcggcacc tcgctaacgg 3600 attcaccactccaagaattg gagccaatca attcttgcgg agaactgtga atgcgcaaac 3660 caacccttggcagaacatat ccatcgcgtc cgccatctcc agcagccgca cgcggcgcat 3720 ctcgggcagcgttgggtcct ggccacgggt gcgcatgatc gtgctcctgt cgttgaggac 3780 ccggctaggctggcggggtt gccttactgg ttagcagaat gaatcaccga tacgcgagcg 3840 aacgtgaagcgactgctgct gcaaaacgtc tgcgacctga gcaacaacat gaatggtctt 3900 cggtttccgtgtttcgtaaa gtctggaaac gcggaagtca gcgccctgca ccattatgtt 3960 ccggatctgcatcgcaggat gctgctggct accctgtgga acacctacat ctgtattaac 4020 gaagcgctggcattgaccct gagtgatttt tctctggtcc cgccgcatcc ataccgccag 4080 ttgtttaccctcacaagttc cagtaaccgg gcatgttcat catcagtaac ccgtatcgtg 4140 agcatcctctctcgtttcat cggtatcatt acccccatga acagaaattc ccccttacac 4200 ggaggcatcaagtgaccaaa caggaaaaaa ccgcccttaa catggcccgc tttatcagaa 4260 gccagacattaacgcttctg gagaaactca acgagctgga cgcggatgaa caggcagaca 4320 tctgtgaatcgcttcacgac cacgctgatg agctttaccg cagctgcctc gcgcgtttcg 4380 gtgatgacggtgaaaacctc tgacacatgc agctcccgga gacggtcaca gcttgtctgt 4440 aagcggatgccgggagcaga caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc 4500 ggggcgcagccatgacccag tcacgtagcg atagcggagt gtatactggc ttaactatgc 4560 ggcatcagagcagattgtac tgagagtgca cgatatccgg tgtgaaatac cgcacagatg 4620 cgtaaggagaaaataccgca tcaggcgctc ttccgcttcc tcgctcactg actcgctgcg 4680 ctcggtcgttcggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc 4740 cacagaatcaggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag 4800 gaaccgtaaaaaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca 4860 tcacaaaaatcgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca 4920 ggcgtttccccctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg 4980 atacctgtccgcctttctcc cttcgggaag cgtggcgctt tctcaatgct cacgctgtag 5040 gtatctcagttcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt 5100 tcagcccgaccgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca 5160 cgacttatcgccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg 5220 cggtgctacagagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt 5280 tggtatctgcgctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc 5340 cggcaaacaaaccaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg 5400 cagaaaaaaaggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg 5460 gaacgaaaactcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta 5520 gatccttttaaattaaaaat gaagttttaa atcaatctaa agtatatatg agtaaacttg 5580 gtctgacagttaccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg 5640 ttcatccatagttgcctgac tccccgtcgt gtagataact acgatacggg agggcttacc 5700 atctggccccagtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc 5760 agcaataaaccagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc 5820 ctccatccagtctattaatt gttgccggga agctagagta agtagttcgc cagttaatag 5880 tttgcgcaacgttgttgcca ttgctgcagg catcgtggtg tcacgctcgt cgtttggtat 5940 ggcttcattcagctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg 6000 caaaaaagcggttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt 6060 gttatcactcatggttatgg cagcactgca taattctctt actgtcatgc catccgtaag 6120 atgcttttctgtgactggtg agtatcaacc aagtcattct gagaatagtg tatgcggcga 6180 ccgagttgctcttgcccggc gtcaacacgg gataataccg cgccacatag cagaacttta 6240 aaagtgctcatcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg 6300 ttgagatccagttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact 6360 ttcaccagcgtttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata 6420 agggcgacacggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt 6480 tatcagggttattgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 6540 ataggggttccgcgcacatt tccccgaaaa gtgccacctg acgtctaaga aaccattatt 6600 atcatgacattaacctataa aaataggcgt atcacgaggc cctttcgtct tcaagaattc 6660 cacggactatagactatact agtatactcc gtctactgta cgatacactt ccgctcaggt 6720 ccttgtcctttaacgaggcc ttaccactct tttgttactc tattgatcca gctcagcaaa 6780 ggcagtgtgatctaagattc tatcttcgcg atgtagtaaa actagctaga ccgagaaaga 6840 gactagaaatgcaaaaggca cttctacaat ggctgccatc attattatcc gatgtgacgc 6900 tgcagaagcagaaatacacg cggtcagtga agctattccg ctattgaata acctcagtca 6960 ccttgtgcaagaacttaaca agaaaccaat tattaaaggc ttacttactg atagtagatc 7020 aacgatcagtataattaagt ctacaaatga agagaaattt agaaacagat tttttggcac 7080 aaaggcaatgagacttagag atgaagtatc aggtaataat ttatacgtat actacatcga 7140 gaccaagaagaacattgctg atgtgatgac aaaacctctt ccgataaaaa catttaaact 7200 attaactaacaaatggattc attagatcta ttacattatg ggtggtatgt tggaataaaa 7260 atcaactatcatctactaac tagtatttac gttactagta tattatcata tacggtgtta 7320 gaagatgacgcaaatgatga gaaatagtca tctaaattag tggaagctga aacgcaagga 7380 ttgataatgtaataggatca atgaatatta acatataaaa tgatgataat aatatttata 7440 gaattgtgtagaattgcaga ttccctttta tggattccta aatcctcgag gagaacttct 7500 agtatatctacatacctaat attattgcct tattaaaaat ggaatcccaa caattacatc 7560 aaaatccacattctcttcaa aatcaattgt cctgtacttc cttgttcatg tgtgttcaaa 7620 aacgttatatttataggata attatactct atttctcaac aagtaattgg ttgtttggcc 7680 gagcggtctaaggcgcctga ttcaagaaat atcttgaccg cagttaactg tgggaatact 7740 caggtatcgtaagatgcaag agttcgaatc tcttagcaac cattattttt ttcctcaaca 7800 taacgagaacacacaggggc gctatcgcac agaatcaaat tcgatgactg gaaatttttt 7860 gttaatttcagaggtcgcct gacgcatata cctttttcaa ctgaaaaatt gggagaaaaa 7920 ggaaaggtgagagccgcgga accggctttt catatagaat agagaagcgt tcatgactaa 7980 atgcttgcatcacaatactt gaagttgaca atattattta aggacctatt gttttttcca 8040 ataggtggttagcaatcgtc ttactttcta acttttctta ccttttacat ttcagcaata 8100 tatatatatatatttcaagg atataccatt ctaatgtctg cccctaagaa gatcgtcgtt 8160 ttgccaggtgaccacgttgg tcaagaaatc acgccgaagc cattaaggtt cttaaagcta 8220 tttctgatgttcgttccaat gtcaagttcg atttcgaaaa tcatttaatt ggtggtgctg 8280 ctatcgatgctacaggtgtc ccacttccag atgaggcgct ggaagcctcc aagaaggttg 8340 atgccgttttgttaggtgct gtgggtggtc ctaaatgggg taccggtagt gttagacctg 8400 aacaaggtttactaaaaatc cgtaaagaac ttcaattgta cgccaactta agaccatgta 8460 actttgcatccgactctctt ttagacttat ctccaatcaa gccacaattt gctaaaggta 8520 ctgacttcgttgttgtcaga gaattagtgg gaggtattta ctttggtaag agaaaggaag 8580 acgatggtgatggtgtcgct tgggatagtg aacaatacac cgttccagaa gtgcaaagaa 8640 tcacaagaatggccgctttc atggccctac aacatgagcc accattgcct atttggtcct 8700 tggataaagctaatgttttg gcctcttcaa gattatggag aaaaactgtg gaggaaacca 8760 tcaagaacgaattccctaca ttgaaggttc aacatcaatt gattgattct gccgccatga 8820 tcctagttaagaacccaacc cacctaaatg gtattataat caccagcaac atgtttggtg 8880 atatcatctccgatgaagcc tccgttatcc caggttcctt gggtttgttg ccatctgcgt 8940 ccttggcctctttgccagac aagaacaccg catttggttt gtacgaacca tgccacggtt 9000 ctgctccagatttgccaaag aataaggtca accctatcgc cactatcttg tctgctgcaa 9060 tgatgttgaaattgtcattg aacttgcctg aagaaggtaa ggccattgaa gatgcagtta 9120 aaaaggttttggatgcaggt atcagaactg gtgatttagg tggttccaac agtaccacgg 9180 aagtcggtgatgctgtcgcc gaagaagtta agaaaatcct tgcttaaaaa gattctcttt 9240 ttttatgatatttgtacata aactttataa atgaaattca taatagaaac gacacgaaat 9300 tacaaaatggaatatgttca tagggtagac gaaactatat acgcaatcta catacattta 9360 tcaagaaggagaaaaaggag gatgtaaagg aatacaggta agcaaattga tactaatggc 9420 tcaacgtgataaggaaaaag aattgcactt taacattaat attgacaagg aggagggcac 9480 cacacaaaaagttaggtgta acagaaaatc atgaaactat gattcctaat ttatatattg 9540 gaggattttctctaaaaaaa aaaaaataca acaaataaaa aacactcaat gacctgacca 9600 tttgatggagtttaagtcaa taccttcttg aaccatttcc cataatggtg aaagttccct 9660 caagaattttactctgtcag aaacggcctt aacgacgtag tcgacctcct cttcagtact 9720 aaatctaccaataccaaatc tgatggaaga atgggctaat gcatcatcct tacccagcgc 9780 atgtaaaacataagaaggtt ctagggaagc agatgtacag gctgaacccg aggataatgc 9840 gatatcccttagtgccatca ataaagattc tccttccacg taggcgaaag aaacgttaac 9900 acaccctggataacgatgat ctggagatcc gttcaacgtg gtatgttcag cggataatag 9960 acctttgactaatttatcgg atagtctttt gatgtgagct tggtcgttgt caaattcttt 10020 cttcatcaatctcgcagctt caccaaatcc cgctaccaat gggggggcca aagtaccaga 10080 tctcaatcctctctcttggc caccaccgga tagtaaaggt tctaatctaa ctcttggtct 10140 ccttcttacatagatggcac ctattccctt tggaccgtaa atcttgtgag aagaaattga 10200 tagtaaatcaatgttcattt cattgacatc aatgtgaatc taccataggc ttgtgcggcg 10260 tcagtatgaaagtagatctt attctttcta caaattgcac caatttcttt aataggttga 10320 atgacaccgatttcattatt gacagccatc acagagacga gacaggtatc tggtctaatg 10380 gcatcttccaattccttcaa atcgataaga ccttgatcgt ccacatttag gaaagtgact 10440 tcaaatccctccttcatcat ggcccgtgcg gcttccaaga cacacttgtg ttccgttcta 10500 gtggtgatgatgtgtttctt agtcttctta taaaatcttg ggacaccctt aagaaccata 10560 ttattagattcggtcgctcc cgaagtgaat attatttcct tggggtcggc attgatcatc 10620 tttgctacgtaagctctagc attttccaca gcagtatttg tttcccaacc gtaagagtga 10680 gtgttggaatgaggattacc ataaagtccc gtataaaact tcaacatcgt atccaaaacc 10740 ctagggtctgttggtgtagt ggcttgcatg tcaagatata tgggacgagt accaaaacct 10800 gtgttttcttgataagcatg gctcattgca gtgctaccag aagctactac agcatctggg 10860 gtggtaccggatgcactcgc acgggcacta gcctgtgcct ttgcagcagc ctgaatatcg 10920 gtatgcgtttccagagagaa gttgtcgtct aacttcacgc ctgctgcagt ctcaatgata 10980 ttcgaatacgctttgaggag atacagccta atatccgaca aactgtttta cagatttacg 11040 atcgtacttgttacccatca ttgaattttg aacatccgaa cctgggagtt ttccctgaaa 11100 cagatagtatatttgaacct gtataataat atatagtcta gcgctttacg gaagacaatg 11160 tatgtatttcggttcctgga gaaactattg catctattgc ataggtaatc ttgcacgtcg 11220 catccccggttcattttctg cgtttccatc ttgcacttca atagcatatc tttgttaacg 11280 aagcatctgtgcttcatttt gtagaacaaa aatgcaacgc gagagcgcta atttttcaaa 11340 caaagaatctgagctgcatt tttacagaac agaaatgcaa cgcgaaagcg ctattttacc 11400 aacgaagaatctgtgcttca tttttgtaaa acaaaaatgc aacgcgagag cgctaatttt 11460 tcaaacaaagaatctgagct gcatttttac agaacagaaa tgcaacgcga gagcgctatt 11520 ttaccaacaaagaatctata cttctttttt gttctacaaa aatgcatccc gagagcgcta 11580 tttttctaacaaagcatctt agattacttt ttttctcctt tgtgcgctct ataatgcagt 11640 ctcttgataactttttgcac tgtaggtccg ttaaggttag aagaaggcta ctttggtgtc 11700 tattttctcttccataaaaa aagcctgact ccacttcccg cgtttactga ttactagcga 11760 agctgcgggtgcattttttc aagataaagg catccccgat tatattctat accgatgtgg 11820 attgcgcatactttgtgaac agaaagtgat agcgttgatg attcttcatt ggtcagaaaa 11880 ttatgaacggtttcttctat tttgtctcta tatactacgt ataggaaatg tttacatttt 11940 cgtattgttttcgattcact ctatgaatag ttcttactac aatttttttg tctaaagagt 12000 aatactagagataaacataa aaaatgtaga ggtcgagttt agatgcaagt tcaaggagcg 12060 aaaggtggatgggtaggtta tatagggata tagcacagag atatatagca aagagatact 12120 tttgagcaatgtttgtggaa gcggtattcg caatatttta gtagctcgtt acagtccggt 12180 gcgtttttggttttttgaaa gtgcgtcttc agagcgcttt tggttttcaa aagcgctctg 12240 aagttcctatactttctaga gaataggaac ttcggaatag gaacttcaag cgtttccgaa 12300 aacgagcgcttccgaaaatg caacgcgagc tgcgcacata cagctcactg ttcacgtcgc 12360 acctatatctgcgtgttgcc tgtatatata tatacatgag aagaacggca tagtgcgtgt 12420 ttatgcttaaatgcgtactt atatgcgtct atttatgtag gatgaaaggt agtctagtac 12480 ctcctgtgatattatcccat tccatgcggg gtatcgtatg cttccttcag cactaccctt 12540 tagctgttctatatgctgcc actcctcaat tggattagtc tcatccttca atgctatcat 12600 ttcctttgatattcgatcct aggcatagta ccgagaaact agtgcgaagt agtgatcagg 12660 tattgctgttatctgatgag tatacgttgt cctggccacg gcagaagcac gcttatcgct 12720 ccaatttcccacaacattag tcaactccgt taggcccttc attgaaagaa atgaggtcat 12780 caaatgtcttccaatgtgag attttgggcc attttttata gcaaagattg aataaggcgc 12840 atttttcttca 12851 22 12850 DNA Saccharomyces cerevisiae 22 aagcttcgcg gccgcggaggtctgcttcac gagcgcggtg tgcgcctagt attgccccga 60 cggtccgggt gcctatccctagatttcgtc gtgccccgac ccaaatagtt aaacgtgtgg 120 tttatgggtg caccagggctttatcgtgtt ttatatcgat ggcgatttgt gcctccagtg 180 tatttttgta tatccaattaaggtttctta cctaatttta tttttatcat ctttagttaa 240 tgctggtttg ctctgtttctgctgctttct gtgcggttct cctcttctct tgtttcttcg 300 tgttgtcccc catcgccgatgggcttatat ggcgtatata tatagagcga gtttttacgt 360 cgaagatcat ctcagtttgcttgatagcct ttctacttta ttactttcgt ttttaacctc 420 attatacttt agttttctttgatcggtttt tttctctgta tacttaaaag ttcaaatcaa 480 agaaacatac aaaactacgtttatatcaat tacatatgga agacgccaaa aacataaaga 540 aaggcccggc gccattctatccgctggaag atggaaccgc tggagagcaa ctgcataagg 600 ctatgaagag atacgccctggttcctggaa caattgcttt tacagatgca catatcgagg 660 tggacatcac ttacgctgagtacttcgaaa tgtccgttcg gttggcagaa gctatgaaac 720 gatatgggct gaatacaaatcacagaatcg tcgtatgcag tgaaaactct cttcaattct 780 ttatgccggt gttgggcgcgttatttatcg gagttgcagt tgcgcccgcg aacgacattt 840 ataatgaacg tgaattgctcaacagtatgg gcatttcgca gcctaccgtg gtgttcgttt 900 ccaaaaaggg gttgcaaaaaattttgaacg tgcaaaaaaa gctcccaatc atccaaaaaa 960 ttattatcat ggattctaaaacggattacc agggatttca gtcgatgtac acgttcgtca 1020 catctcatct acctcccggttttaatgaat acgattttgt gccagagtcc ttcgataggg 1080 acaagacaat tgcactgatcatgaactcct ctggatctac tggtctgcct aaaggtgtcg 1140 ctctgcctca tagaactgcctgcgtgagat tctcgcatgc cagagatcct atttttggca 1200 atcaaatcat tccggatactgcgattttaa gtgttgttcc attccatcac ggttttggaa 1260 tgtttactac actcggatatttgatatgtg gatttcgagt cgtcttaatg tatagatttg 1320 aagaagagct gtttctgaggagccttcagg attacaagat tcaaagtgcg ctgctggtgc 1380 caaccctatt ctccttcttcgccaaaagca ctctgattga caaatacgat ttatctaatt 1440 tacacgaaat tgcttctggtggcgctcccc tctctaagga agtcggggaa gcggttgcca 1500 agaggttcca tctgccaggtatcaggcaag gatatgggct cactgagact acatcagcta 1560 ttctgattac acccgagggggatgataaac cgggcgcggt cggtaaagtt gttccatttt 1620 ttgaagcgaa ggttgtggatctggataccg ggaaaacgct gggcgttaat caaagaggcg 1680 aactgtgtgt gagaggtcctatgattatgt ccggttatgt aaacaatccg gaagcgacca 1740 acgccttgat tgacaaggatggatggctac attctggaga catagcttac tgggacgaag 1800 acgaacactt cttcatcgttgaccgcctga agtctctgat taagtacaaa ggctatcagg 1860 tggctcccgc tgaattggaatccatcttgc tccaacaccc caacatcttc gacgcaggtg 1920 tcgcaggtct tcccgacgatgacgccggtg aacttcccgc cgccgttgtt gttttggagc 1980 acggaaagac gatgacggaaaaagagatcg tggattacgt cgccagtcaa gtaacaaccg 2040 cgaaaaagtt gcgcggaggagttgtgtttg tggacgaagt accgaaaggt cttaccggaa 2100 aactcgacgc aagaaaaatcagagagatcc tcataaaggc caagaagggc ggaaagatcg 2160 ccgtgtaatt ggatccagtttaaacagtag ctttggactt cttcgccaga ggtttggtca 2220 agtctccaat caaggttgtcggcttgtcta ccttgccaga aatttacgaa aagatggaaa 2280 agggtcaaat cgttggtagatacgttgttg acacttctaa ataagcgaat ttcttatgat 2340 ttatgatttt tattattaaataagttataa aaaaaataag tgtatacaaa ttttaaagtg 2400 actcttaggt tttaaaacgaaaattcttgt tcttgagtaa ctctttcctg taggtcaggt 2460 tgctttctca ggtatagcatgaggtcgctc ttattgacca cacctctacc ggcatgccga 2520 gcaaatgcct gcaaatcgctccccatttca cccaattgta gatatgctaa ctccagcaat 2580 gagttgatga atctcggtgtgtattttatg tcctcagaag acaacacctg ttgtaatcgt 2640 tcttccacac ggatcgcggccgcttgatcc tctacgccgg acgcatcgtg gccggcatca 2700 ccggcgccac aggtgcggttgctggcgcct atatcgccga catcaccgat ggggaagatc 2760 gggctcgcca cttcgggctcatgagcgctt gtttcggcgt gggtatggtg gcaggccccg 2820 tggccggggg actgttgggcgccatctcct tgcatgcacc attccttgcg gcggcggtgc 2880 tcaacggcct caacctactactgggctgct tcctaatgca ggagtcgcat aagggagagc 2940 gtcgaccgat gcccttgagagccttcaacc cagtcagctc cttccggtgg gcgcggggca 3000 tgactatcgt cgccgcacttatgactgtct tctttatcat gcaactcgta ggacaggtgc 3060 cggcagcgct ctgggtcattttcggcgagg accgctttcg ctggagcgcg acgatgatcg 3120 gcctgtcgct tgcggtattcggaatcttgc acgccctcgc tcaagccttc gtcactggtc 3180 ccgccaccaa acgtttcggcgagaagcagg ccattatcgc cggcatggcg gccgacgcgc 3240 tgggctacgt cttgctggcgttcgcgacgc gaggctggat ggccttcccc attatgattc 3300 ttctcgcttc cggcggcatcgggatgcccg cgttgcaggc catgctgtcc aggcaggtag 3360 atgacgacca tcagggacagcttcaaggat cgctcgcggc tcttaccagc ctaacttcga 3420 tcactggacc gctgatcgtcacggcgattt atgccgcctc ggcgagcaca tggaacgggt 3480 tggcatggat tgtaggcgccgccctatacc ttgtctgcct ccccgcgttg cgtcgcggtg 3540 catggagccg ggccacctcgacctgaatgg aagccggcgg cacctcgcta acggattcac 3600 cactccaaga attggagccaatcaattctt gcggagaact gtgaatgcgc aaaccaaccc 3660 ttggcagaac atatccatcgcgtccgccat ctccagcagc cgcacgcggc gcatctcggg 3720 cagcgttggg tcctggccacgggtgcgcat gatcgtgctc ctgtcgttga ggacccggct 3780 aggctggcgg ggttgccttactggttagca gaatgaatca ccgatacgcg agcgaacgtg 3840 aagcgactgc tgctgcaaaacgtctgcgac ctgagcaaca acatgaatgg tcttcggttt 3900 ccgtgtttcg taaagtctggaaacgcggaa gtcagcgccc tgcaccatta tgttccggat 3960 ctgcatcgca ggatgctgctggctaccctg tggaacacct acatctgtat taacgaagcg 4020 ctggcattga ccctgagtgatttttctctg gtcccgccgc atccataccg ccagttgttt 4080 accctcacaa cgttccagtaaccgggcatg ttcatcatca gtaacccgta tcgtgagcat 4140 cctctctcgt ttcatcggtatcattacccc catgaacaga aattccccct tacacggagg 4200 catcaagtga ccaaacaggaaaaaaccgcc cttaacatgg cccgctttat cagaagccag 4260 acattaacgc ttctggagaaactcaacgag ctggacgcgg atgaacaggc agacatctgt 4320 gaatcgcttc acgaccacgctgatgagctt taccgcagct gcctcgcgcg tttcggtgat 4380 gacggtgaaa acctctgacacatgcagctc ccggagacgg tcacagcttg tctgtaagcg 4440 gatgccggga gcagacaagcccgtcagggc gcgtcagcgg gtgttggcgg gtgtcggggc 4500 gcagccatga cccagtcacgtagcgatagc ggagtgtata ctggcttaac tatgcggcat 4560 cagagcagat tgtactgagagtgcacgata tccggtgtga aataccgcac agatgcgtaa 4620 ggagaaaata ccgcatcaggcgctcttccg cttcctcgct cactgactcg ctgcgctcgg 4680 tcgttcggct gcggcgagcggtatcagctc actcaaaggc ggtaatacgg ttatccacag 4740 aatcagggga taacgcaggaaagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 4800 gtaaaaaggc cgcgttgctggcgtttttcc ataggctccg cccccctgac gagcatcaca 4860 aaaatcgacg ctcaagtcagaggtggcgaa acccgacagg actataaaga taccaggcgt 4920 ttccccctgg aagctccctcgtgcgctctc ctgttccgac cctgccgctt accggatacc 4980 tgtccgcctt tctcccttcgggaagcgtgg cgctttctca atgctcacgc tgtaggtatc 5040 tcagttcggt gtaggtcgttcgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 5100 ccgaccgctg cgccttatccggtaactatc gtcttgagtc caacccggta agacacgact 5160 tatcgccact ggcagcagccactggtaaca ggattagcag agcgaggtat gtaggcggtg 5220 ctacagagtt cttgaagtggtggcctaact acggctacac tagaaggaca gtatttggta 5280 tctgcgctct gctgaagccagttaccttcg gaaaaagagt tggtagctct tgatccggca 5340 aacaaaccac cgctggtagcggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 5400 aaaaaggatc tcaagaagatcctttgatct tttctacggg gtctgacgct cagtggaacg 5460 aaaactcacg ttaagggattttggtcatga gattatcaaa aaggatcttc acctagatcc 5520 ttttaaatta aaaatgaagttttaaatcaa tctaaagtat atatgagtaa acttggtctg 5580 acagttacca atgcttaatcagtgaggcac ctatctcagc gatctgtcta tttcgttcat 5640 ccatagttgc ctgactccccgtcgtgtaga taactacgat acgggagggc ttaccatctg 5700 gccccagtgc tgcaatgataccgcgagacc cacgctcacc ggctccagat ttatcagcaa 5760 taaaccagcc agccggaagggccgagcgca gaagtggtcc tgcaacttta tccgcctcca 5820 tccagtctat taattgttgccgggaagcta gagtaagtag ttcgccagtt aatagtttgc 5880 gcaacgttgt tgccattgctgcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 5940 cattcagctc cggttcccaacgatcaaggc gagttacatg atcccccatg ttgtgcaaaa 6000 aagcggttag ctccttcggtcctccgatcg ttgtcagaag taagttggcc gcagtgttat 6060 cactcatggt tatggcagcactgcataatt ctcttactgt catgccatcc gtaagatgct 6120 tttctgtgac tggtgagtactcaaccaagt cattctgaga atagtgtatg cggcgaccga 6180 gttgctcttg cccggcgtcaacacgggata ataccgcgcc acatagcaga actttaaaag 6240 tgctcatcat tggaaaacgttcttcggggc gaaaactctc aaggatctta ccgctgttga 6300 gatccagttc gatgtaacccactcgtgcac ccaactgatc ttcagcatct tttactttca 6360 ccagcgtttc tgggtgagcaaaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 6420 cgacacggaa atgttgaatactcatactct tcctttttca atattattga agcatttatc 6480 agggttattg tctcatgagcggatacatat ttgaatgtat ttagaaaaat aaacaaatag 6540 gggttccgcg cacatttccccgaaaagtgc cacctgacgt ctaagaaacc attattatca 6600 tgacattaac ctataaaaataggcgtatca cgaggccctt tcgtcttcaa gaattccacg 6660 gactatagac tatactagtatactccgtct actgtacgat acacttccgc tcaggtcctt 6720 gtcctttaac gaggccttaccactcttttg ttactctatt gatccagctc agcaaaggca 6780 gtgtgatcta agattctatcttcgcgatgt agtaaaacta gctagaccga gaaagagact 6840 agaaatgcaa aaggcacttctacaatggct gccatcatta ttatccgatg tgacgctgca 6900 gaagcagaaa tacacgcggtcagtgaagct attccgctat tgaataacct cagtcacctt 6960 gtgcaagaac ttaacaagaaaccaattatt aaaggcttac ttactgatag tagatcaacg 7020 atcagtataa ttaagtctacaaatgaagag aaatttagaa acagattttt tggcacaaag 7080 gcaatgagac ttagagatgaagtatcaggt aataatttat acgtatacta catcgagacc 7140 aagaagaaca ttgctgatgtgatgacaaaa cctcttccga taaaaacatt taaactatta 7200 actaacaaat ggattcattagatctattac attatgggtg gtatgttgga ataaaaatca 7260 actatcatct actaactagtatttacgtta ctagtatatt atcatatacg gtgttagaag 7320 atgacgcaaa tgatgagaaatagtcatcta aattagtgga agctgaaacg caaggattga 7380 taatgtaata ggatcaatgaatattaacat ataaaatgat gataataata tttatagaat 7440 tgtgtagaat tgcagattcccttttatgga ttcctaaatc ctcgaggaga acttctagta 7500 tatctacata cctaatattattgccttatt aaaaatggaa tcccaacaat tacatcaaaa 7560 tccacattct cttcaaaatcaattgtcctg tacttccttg ttcatgtgtg ttcaaaaacg 7620 ttatatttat aggataattatactctattt ctcaacaagt aattggttgt ttggccgagc 7680 ggtctaaggc gcctgattcaagaaatatct tgaccgcagt taactgtggg aatactcagg 7740 tatcgtaaga tgcaagagttcgaatctctt agcaaccatt atttttttcc tcaacataac 7800 gagaacacac aggggcgctatcgcacagaa tcaaattcga tgactggaaa ttttttgtta 7860 atttcagagg tcgcctgacgcatatacctt tttcaactga aaaattggga gaaaaaggaa 7920 aggtgagagc cgcggaaccggcttttcata tagaatagag aagcgttcat gactaaatgc 7980 ttgcatcaca atacttgaagttgacaatat tatttaagga cctattgttt tttccaatag 8040 gtggttagca atcgtcttactttctaactt ttcttacctt ttacatttca gcaatatata 8100 tatatatatt tcaaggatataccattctaa tgtctgcccc taagaagatc gtcgttttgc 8160 caggtgacca cgttggtcaagaaatcacag ccgaagccat taaggttctt aaagctattt 8220 ctgatgttcg ttccaatgtcaagttcgatt tcgaaaatca tttaattggt ggtgctgcta 8280 tcgatgctac aggtgtcccacttccagatg aggcgctgga agcctccaag aaggttgatg 8340 ccgttttgtt aggtgctgtgggtggtccta aatggggtac cggtagtgtt agacctgaac 8400 aaggtttact aaaaatccgtaaagaacttc aattgtacgc caacttaaga ccatgtaact 8460 ttgcatccga ctctcttttagacttatctc caatcaagcc acaatttgct aaaggtactg 8520 acttcgttgt tgtcagagaattagtgggag gtatttactt tggtaagaga aaggaagacg 8580 atggtgatgg tgtcgcttgggatagtgaac aatacaccgt tccagaagtg caaagaatca 8640 caagaatggc cgctttcatggccctacaac atgagccacc attgcctatt tggtccttgg 8700 ataaagctaa tgttttggcctcttcaagat tatggagaaa aactgtggag gaaaccatca 8760 agaacgaatt ccctacattgaaggttcaac atcaattgat tgattctgcc gccatgatcc 8820 tagttaagaa cccaacccacctaaatggta ttataatcac cagcaacatg tttggtgata 8880 tcatctccga tgaagcctccgttatcccag gttccttggg tttgttgcca tctgcgtcct 8940 tggcctcttt gccagacaagaacaccgcat ttggtttgta cgaaccatgc cacggttctg 9000 ctccagattt gccaaagaataaggtcaacc ctatcgccac tatcttgtct gctgcaatga 9060 tgttgaaatt gtcattgaacttgcctgaag aaggtaaggc cattgaagat gcagttaaaa 9120 aggttttgga tgcaggtatcagaactggtg atttaggtgg ttccaacagt accacggaag 9180 tcggtgatgc tgtcgccgaagaagttaaga aaatccttgc ttaaaaagat tctctttttt 9240 tatgatattt gtacataaactttataaatg aaattcataa tagaaacgac acgaaattac 9300 aaaatggaat atgttcatagggtagacgaa actatatacg caatctacat acatttatca 9360 agaaggagaa aaaggaggatgtaaaggaat acaggtaagc aaattgatac taatggctca 9420 acgtgataag gaaaaagaattgcactttaa cattaatatt gacaaggagg agggcaccac 9480 acaaaaagtt aggtgtaacagaaaatcatg aaactatgat tcctaattta tatattggag 9540 gattttctct aaaaaaaaaaaaatacaaca aataaaaaac actcaatgac ctgaccattt 9600 gatggagttt aagtcaataccttcttgaac catttcccat aatggtgaaa gttccctcaa 9660 gaattttact ctgtcagaaacggccttaac gacgtagtcg acctcctctt cagtactaaa 9720 tctaccaata ccaaatctgatggaagaatg ggctaatgca tcatccttac ccagcgcatg 9780 taaaacataa gaaggttctagggaagcaga tgtacaggct gaacccgagg ataatgcgat 9840 atcccttagt gccatcaataaagattctcc ttccacgtag gcgaaagaaa cgttaacaca 9900 ccctggataa cgatgatctggagatccgtt caacgtggta tgttcagcgg ataatagacc 9960 tttgactaat ttatcggatagtcttttgat gtgagcttgg tcgttgtcaa attctttctt 10020 catcaatctc gcagcttcaccaaatcccgc taccaatggg ggggccaaag taccagatct 10080 caatcctctc tcttggccaccaccggatag taaaggttct aatctaactc ttggtctcct 10140 tcttacatag atggcacctattccctttgg accgtaaatc ttgtgagaag aaattgatag 10200 taaatcaatg ttcatttcattgacatcaat gtgaatctta ccataggctt gtgcggcgtc 10260 agtatgaaag tagatcttattctttctaca aattgcacca atttctttaa taggttgaat 10320 gacaccgatt tcattattgacagccatcac agagacgaga caggtatctg gtctaatggc 10380 atcttccaat tccttcaaatcgataagacc ttgatcgtcc acatttagga aagtgacttc 10440 aaatccctcc ttcatcatggcccgtgcggc ttccaagaca cacttgtgtt ccgttctagt 10500 ggtgatgatg tgtttcttagtcttcttata aaatcttggg acacccttaa gaaccatatt 10560 attagattcg gtcgctcccgaagtgaatat tatttccttg gggtcggcat tgatcatctt 10620 tgctacgtaa gctctagcattttccacagc agtatttgtt tcccaaccgt aagagtgagt 10680 gttggaatga ggattaccataaagtcccgt ataaaacttc aacatcgtat ccaaaaccct 10740 agggtctgtt ggtgtagtggcttgcatgtc aagatatatg ggacgagtac caaaacctgt 10800 gttttcttga taagcatggctcattgcagt gctaccagaa gctactacag catctggggt 10860 ggtaccggat gcactcgcacgggcactagc ctgtgccttt gcagcagcct gaatatcggt 10920 atgcgtttcc agagagaagttgtcgtctaa cttcacgcct gctgcagtct caatgatatt 10980 cgaatacgct ttgaggagatacagcctaat atccgacaaa ctgttttaca gatttacgat 11040 cgtacttgtt acccatcattgaattttgaa catccgaacc tgggagtttt ccctgaaaca 11100 gatagtatat ttgaacctgtataataatat atagtctagc gctttacgga agacaatgta 11160 tgtatttcgg ttcctggagaaactattgca tctattgcat aggtaatctt gcacgtcgca 11220 tccccggttc attttctgcgtttccatctt gcacttcaat agcatatctt tgttaacgaa 11280 gcatctgtgc ttcattttgtagaacaaaaa tgcaacgcga gagcgctaat ttttcaaaca 11340 aagaatctga gctgcatttttacagaacag aaatgcaacg cgaaagcgct attttaccaa 11400 cgaagaatct gtgcttcatttttgtaaaac aaaaatgcaa cgcgagagcg ctaatttttc 11460 aaacaaagaa tctgagctgcatttttacag aacagaaatg caacgcgaga gcgctatttt 11520 accaacaaag aatctatacttcttttttgt tctacaaaaa tgcatcccga gagcgctatt 11580 tttctaacaa agcatcttagattacttttt ttctcctttg tgcgctctat aatgcagtct 11640 cttgataact ttttgcactgtaggtccgtt aaggttagaa gaaggctact ttggtgtcta 11700 ttttctcttc cataaaaaaagcctgactcc acttcccgcg tttactgatt actagcgaag 11760 ctgcgggtgc attttttcaagataaaggca tccccgatta tattctatac cgatgtggat 11820 tgcgcatact ttgtgaacagaaagtgatag cgttgatgat tcttcattgg tcagaaaatt 11880 atgaacggtt tcttctattttgtctctata tactacgtat aggaaatgtt tacattttcg 11940 tattgttttc gattcactctatgaatagtt cttactacaa tttttttgtc taaagagtaa 12000 tactagagat aaacataaaaaatgtagagg tcgagtttag atgcaagttc aaggagcgaa 12060 aggtggatgg gtaggttatatagggatata gcacagagat atatagcaaa gagatacttt 12120 tgagcaatgt ttgtggaagcggtattcgca atattttagt agctcgttac agtccggtgc 12180 gtttttggtt ttttgaaagtgcgtcttcag agcgcttttg gttttcaaaa gcgctctgaa 12240 gttcctatac tttctagagaataggaactt cggaatagga acttcaaagc gtttccgaaa 12300 acgagcgctt ccgaaaatgcaacgcgagct gcgcacatac agctcactgt tcacgtcgca 12360 cctatatctg cgtgttgcctgtatatatat atacatgaga agaacggcat agtgcgtgtt 12420 tatgcttaaa tgcgtacttatatgcgtcta tttatgtagg atgaaaggta gtctagtacc 12480 tcctgtgata ttatcccattccatgcgggg tatcgtatgc ttccttcagc actacccttt 12540 agctgttcta tatgctgccactcctcaatt ggattagtct catccttcaa tgctatcatt 12600 tcctttgata ttcgatcctaggcatagtac cgagaaacta gtgcgaagta gtgatcaggt 12660 attgctgtta tctgatgagtatacgttgtc ctggccacgg cagaagcacg cttatcgctc 12720 caatttccca caacattagtcaactccgtt aggcccttca ttgaaagaaa tgaggtcatc 12780 aaatgtcttc caatgtgagattttgggcca ttttttatag caaagattga ataaggcgca 12840 tttttcttca 12850 2311198 DNA Saccharomyces cerevisiae 23 agcttcgcgg ccgccgtctg atttccgttttgggaatcct ttgccgcgcg cccctctcaa 60 aactccgcac aagtcccaga aagcgggaaagaaataaaac gccaccaaaa aaaaaaaaat 120 aaaagccaat cctcgaagcg tgggtggtaggccctggatt atcccgtaca agtatttctc 180 aggagtaaaa aaaccgtttg ttttggaattccccatttcg cggccaccta cgccgctatc 240 tttgcaacaa ctatctgcga taactcagcaaattttgcat attcgtgttg cagtattgcg 300 ataatgggag tcttactccc aacataacggcagaaagaaa tgtgagaaaa ttttgcatcc 360 tttgcctccg ttcaagtata taaagtcggcatgcttgata atctttcttt ccatcctaca 420 ttgttctaat tattcttatt ctcctttattctttcctaac ataccaagaa attaatcttc 480 tgtcattcgc ttaaacacta tatcacatatgcggtccgga tccagtttaa acagtagctt 540 tggacttctt cgccagaggt ttggtcaagtctccaatcaa ggttgtcggc ttgtctacct 600 tgccagaaat ttacgaaaag atggaaaagggtcaaatcgt tggtagatac gttgttgaca 660 cttctaaata agcgaatttc ttatgatttatgatttttat tattaaataa gttataaaaa 720 aaataagtgt atacaaattt taaagtgactcttaggtttt aaaacgaaaa ttcttgttct 780 tgagtaactc tttcctgtag gtcaggttgctttctcaggt atagcatgag gtcgctctta 840 ttgaccacac ctctaccggc atgccgagcaaatgcctgca aatcgctccc catttcaccc 900 aattgtagat atgctaactc cagcaatgagttgatgaatc tcggtgtgta ttttatgtcc 960 tcagaagaca acacctgttg taatcgttcttccacacgga tcgcggccgc ttgatcctct 1020 acgccggacg catcgtggcc ggcatcaccggcgccacagg tgcggttgct ggcgcctata 1080 tcgccgacat caccgatggg gaagatcgggctcgccactt cgggctcatg agcgcttgtt 1140 tcggcgtggg tatggtggca ggccccgtggccgggggact gttgggcgcc atctccttgc 1200 atgcaccatt ccttgcggcg gcggtgctcaacggcctcaa cctactactg ggctgcttcc 1260 taatgcagga gtcgcataag ggagagcgtcgaccgatgcc cttgagagcc ttcaacccag 1320 tcagctcctt ccggtgggcg cggggcatgactatcgtcgc cgcacttatg actgtcttct 1380 ttatcatgca actcgtagga caggtgccggcagcgctctg ggtcattttc ggcgaggacc 1440 gctttcgctg gagcgcgacg atgatcggcctgtcgcttgc ggtattcgga atcttgcacg 1500 ccctcgctca agccttcgtc actggtcccgccaccaaacg tttcggcgag aagcaggcca 1560 ttatcgccgg catggcggcc gacgcgctgggctacgtctt gctggcgttc gcgacgcgag 1620 gctggatggc cttccccatt atgattcttctcgcttccgg cggcatcggg atgcccgcgt 1680 tgcaggccat gctgtccagg caggtagatgacgaccatca gggacagctt caaggatcgc 1740 tcgcggctct taccagccta acttcgatcactggaccgct gatcgtcacg gcgatttatg 1800 ccgcctcggc gagcacatgg aacgggttggcatggattgt aggcgccgcc ctataccttg 1860 tctgcctccc cgcgttgcgt cgcggtgcatggagccgggc cacctcgacc tgaatggaag 1920 ccggcggcac ctcgctaacg gattcaccactccaagaatt ggagccaatc aattcttgcg 1980 gagaactgtg aatgcgcaaa ccaacccttggcagaacata tccatcgcgt ccgccatctc 2040 cagcagccgc acgcggcgca tctcgggcagcgttgggtcc tggccacggg tgcgcatgat 2100 cgtgctcctg tcgttgagga cccggctaggctggcggggt tgccttactg gttagcagaa 2160 tgaatcaccg atacgcgagc gaacgtgaagcgactgctgc tgcaaaacgt ctgcgacctg 2220 agcaacaaca tgaatggtct tcggtttccgtgtttcgtaa agtctggaaa cgcggaagtc 2280 agcgccctgc accattatgt tccggatctgcatcgcagga tgctgctggc taccctgtgg 2340 aacacctaca tctgtattaa cgaagcgctggcattgaccc tgagtgattt ttctctggtc 2400 ccgccgcatc cataccgcca gttgtttaccctcacaacgt tccagtaacc gggcatgttc 2460 atcatcagta acccgtatcg tgagcatcctctctcgtttc atcggtatca ttacccccat 2520 gaacagaaat tcccccttac acggaggcatcaagtgacca aacaggaaaa aaccgccctt 2580 aacatggccc gctttatcag aagccagacattaacgcttc tggagaaact caacgagctg 2640 gacgcggatg aacaggcaga catctgtgaatcgcttcacg accacgctga tgagctttac 2700 cgcagctgcc tcgcgcgttt cggtgatgacggtgaaaacc tctgacacat gcagctcccg 2760 gagacggtca cagcttgtct gtaagcggatgccgggagca gacaagcccg tcagggcgcg 2820 tcagcgggtg ttggcgggtg tcggggcgcagccatgaccc agtcacgtag cgatagcgga 2880 gtgtatactg gcttaactat gcggcatcagagcagattgt actgagagtg cacgatatcc 2940 ggtgtgaaat accgcacaga tgcgtaaggagaaaataccg catcaggcgc tcttccgctt 3000 cctcgctcac tgactcgctg cgctcggtcgttcggctgcg gcgagcggta tcagctcact 3060 caaaggcggt aatacggtta tccacagaatcaggggataa cgcaggaaag aacatgtgag 3120 caaaaggcca gcaaaaggcc aggaaccgtaaaaaggccgc gttgctggcg tttttccata 3180 ggctccgccc ccctgacgag catcacaaaaatcgacgctc aagtcagagg tggcgaaacc 3240 cgacaggact ataaagatac caggcgtttccccctggaag ctccctcgtg cgctctcctg 3300 ttccgaccct gccgcttacc ggatacctgtccgcctttct cccttcggga agcgtggcgc 3360 tttctcaatg ctcacgctgt aggtatctcagttcggtgta ggtcgttcgc tccaagctgg 3420 gctgtgtgca cgaacccccc gttcagcccgaccgctgcgc cttatccggt aactatcgtc 3480 ttgagtccaa cccggtaaga cacgacttatcgccactggc agcagccact ggtaacagga 3540 ttagcagagc gaggtatgta ggcggtgctacagagttctt gaagtggtgg cctaactacg 3600 gctacactag aaggacagta tttggtatctgcgctctgct gaagccagtt accttcggaa 3660 aaagagttgg tagctcttga tccggcaaacaaaccaccgc tggtagcggt ggtttttttg 3720 tttgcaagca gcagattacg cgcagaaaaaaaggatctca agaagatcct ttgatctttt 3780 ctacggggtc tgacgctcag tggaacgaaaactcacgtta agggattttg gtcatgagat 3840 tatcaaaaag gatcttcacc tagatccttttaaattaaaa atgaagtttt aaatcaatct 3900 aaagtatata tgagtaaact tggtctgacagttaccaatg cttaatcagt gaggcaccta 3960 tctcagcgat ctgtctattt cgttcatccatagttgcctg actccccgtc gtgtagataa 4020 ctacgatacg ggagggctta ccatctggccccagtgctgc aatgataccg cgagacccac 4080 gctcaccggc tccagattta tcagcaataaaccagccagc cggaagggcc gagcgcagaa 4140 gtggtcctgc aactttatcc gcctccatccagtctattaa ttgttgccgg gaagctagag 4200 taagtagttc gccagttaat agtttgcgcaacgttgttgc cattgctgca ggcatcgtgg 4260 tgtcacgctc gtcgtttggt atggcttcattcagctccgg ttcccaacga tcaaggcgag 4320 ttacatgatc ccccatgttg tgcaaaaaagcggttagctc cttcggtcct ccgatcgttg 4380 tcagaagtaa gttggccgca gtgttatcactcatggttat ggcagcactg cataattctc 4440 ttactgtcat gccatccgta agatgcttttctgtgactgg tgagtactca accaagtcat 4500 tctgagaata gtgtatgcgg cgaccgagttgctcttgccc ggcgtcaaca cgggataata 4560 ccgcgccaca tagcagaact ttaaaagtgctcatcattgg aaaacgttct tcggggcgaa 4620 aactctcaag gatcttaccg ctgttgagatccagttcgat gtaacccact cgtgcaccca 4680 actgatcttc agcatctttt actttcaccagcgtttctgg gtgagcaaaa acaggaaggc 4740 aaaatgccgc aaaaaaggga ataagggcgacacggaaatg ttgaatactc atactcttcc 4800 tttttcaata ttattgaagc atttatcagggttattgtct catgagcgga tacatatttg 4860 aatgtattta gaaaaataaa caaataggggttccgcgcac atttccccga aaagtgccac 4920 ctgacgtcta agaaaccatt attatcatgacattaaccta taaaaatagg cgtatcacga 4980 ggccctttcg tcttcaagaa ttccacggactatagactat actagtatac tccgtctact 5040 gtacgataca cttccgctca ggtccttgtcctttaacgag gccttaccac tcttttgtta 5100 ctctattgat ccagctcagc aaaggcagtgtgatctaaga ttctatcttc gcgatgtagt 5160 aaaactagct agaccgagaa agagactagaaatgcaaaag gcacttctac aatggctgcc 5220 atcattatta tccgatgtga cgctgcagaagcagaaatac acgcggtcag tgaagctatt 5280 ccgctattga ataacctcag tcaccttgtgcaagaactta acaagaaacc aattattaaa 5340 ggcttactta ctgatagtag atcaacgatcagtataatta agtctacaaa tgaagagaaa 5400 tttagaaaca gattttttgg cacaaaggcaatgagactta gagatgaagt atcaggtaat 5460 aatttatacg tatactacat cgagaccaagaagaacattg ctgatgtgat gacaaaacct 5520 cttccgataa aaacatttaa actattaactaacaaatgga ttcattagat ctattacatt 5580 atgggtggta tgttggaata aaaatcaactatcatctact aactagtatt tacgttacta 5640 gtatattatc atatacggtg ttagaagatgacgcaaatga tgagaaatag tcatctaaat 5700 tagtggaagc tgaaacgcaa ggattgataatgtaatagga tcaatgaata ttaacatata 5760 aaatgatgat aataatattt atagaattgtgtagaattgc agattccctt ttatggattc 5820 ctaaatcctc gaggagaact tctagtatatctacatacct aatattattg ccttattaaa 5880 aatggaatcc caacaattac atcaaaatccacattctctt caaaatcaat tgtcctgtac 5940 ttccttgttc atgtgtgttc aaaaacgttatatttatagg ataattatac tctatttctc 6000 aacaagtaat tggttgtttg gccgagcggtctaaggcgcc tgattcaaga aatatcttga 6060 ccgcagttaa ctgtgggaat actcaggtatcgtaagatgc aagagttcga atctcttagc 6120 aaccattatt tttttcctca acataacgagaacacacagg ggcgctatcg cacagaatca 6180 aattcgatga ctggaaattt tttgttaatttcagaggtcg cctgacgcat ataccttttt 6240 caactgaaaa attgggagaa aaaggaaaggtgagagccgc ggaaccggct tttcatatag 6300 aatagagaag cgttcatgac taaatgcttgcatcacaata cttgaagttg acaatattat 6360 ttaaggacct attgtttttt ccaataggtggttagcaatc gtcttacttt ctaacttttc 6420 ttacctttta catttcagca atatatatatatatatttca aggatatacc attctaatgt 6480 ctgcccctaa gaagatcgtc gttttgccaggtgaccacgt tggtcaagaa atcacagccg 6540 aagccattaa ggttcttaaa gctatttctgatgttcgttc caatgtcaag ttcgatttcg 6600 aaaatcattt aattggtggt gctgctatcgatgctacagg tgtcccactt ccagatgagg 6660 cgctggaagc ctccaagaag gttgatgccgttttgttagg tgctgtgggt ggtcctaaat 6720 ggggtaccgg tagtgttaga cctgaacaaggtttactaaa aatccgtaaa gaacttcaat 6780 tgtacgccaa cttaagacca tgtaactttgcatccgactc tcttttagac ttatctccaa 6840 tcaagccaca atttgctaaa ggtactgacttcgttgttgt cagagaatta gtgggaggta 6900 tttactttgg taagagaaag gaagacgatggtgatggtgt cgcttgggat agtgaacaat 6960 acaccgttcc agaagtgcaa agaatcacaagaatggccgc tttcatggcc ctacaacatg 7020 agccaccatt gcctatttgg tccttggataaagctaatgt tttggcctct tcaagattat 7080 ggagaaaaac tgtggaggaa accatcaagaacgaattccc tacattgaag gttcaacatc 7140 aattgattga ttctgccgcc atgatcctagttaagaaccc aacccaccta aatggtatta 7200 taatcaccag caacatgttt ggtgatatcatctccgatga agcctccgtt atcccaggtt 7260 ccttgggttt gttgccatct gcgtccttggcctctttgcc agacaagaac accgcatttg 7320 gtttgtacga accatgccac ggttctgctccagatttgcc aaagaataag gtcaacccta 7380 tcgccactat cttgtctgct gcaatgatgttgaaattgtc attgaacttg cctgaagaag 7440 gtaaggccat tgaagatgca gttaaaaaggttttggatgc aggtatcaga actggtgatt 7500 taggtggttc caacagtacc acggaagtcggtgatgctgt cgccgaagaa gttaagaaaa 7560 tccttgctta aaaagattct ctttttttatgatatttgta cataaacttt ataaatgaaa 7620 ttcataatag aaacgacacg aaattacaaaatggaatatg ttcatagggt agacgaaact 7680 atatacgcaa tctacataca tttatcaagaaggagaaaaa ggaggatgta aaggaataca 7740 ggtaagcaaa ttgatactaa tggctcaacgtgataaggaa aaagaattgc actttaacat 7800 taatattgac aaggaggagg gcaccacacaaaaagttagg tgtaacagaa aatcatgaaa 7860 ctatgattcc taatttatat attggaggattttctctaaa aaaaaaaaaa tacaacaaat 7920 aaaaaacact caatgacctg accatttgatggagtttaag tcaatacctt cttgaaccat 7980 ttcccataat ggtgaaagtt ccctcaagaattttactctg tcagaaacgg ccttaacgac 8040 gtagtcgacc tcctcttcag tactaaatctaccaatacca aatctgatgg aagaatgggc 8100 taatgcatca tccttaccca gcgcatgtaaaacataagaa ggttctaggg aagcagatgt 8160 acaggctgaa cccgaggata atgcgatatcccttagtgcc atcaataaag attctccttc 8220 cacgtaggcg aaagaaacgt taacacaccctggataacga tgatctggag atccgttcaa 8280 cgtggtatgt tcagcggata atagacctttgactaattta tcggatagtc ttttgatgtg 8340 agcttggtcg ttgtcaaatt ctttcttcatcaatctcgca gcttcaccaa atcccgctac 8400 caatgggggg gccaaagtac cagatctcaatcctctctct tggccaccac cggatagtaa 8460 aggttctaat ctaactcttg gtctccttcttacatagatg gcacctattc cctttggacc 8520 gtaaatcttg tgagaagaaa ttgatagtaaatcaatgttc atttcattga catcaatgtg 8580 aatcttacca taggcttgtg cggcgtcagtatgaaagtag atcttattct ttctacaaat 8640 tgcaccaatt tctttaatag gttgaatgacaccgatttca ttattgacag ccatcacaga 8700 gacgagacag gtatctggtc taatggcatcttccaattcc ttcaaatcga taagaccttg 8760 atcgtccaca tttaggaaag tgacttcaaatccctccttc atcatggccc gtgcggcttc 8820 caagacacac ttgtgttccg ttctagtggtgatgatgtgt ttcttagtct tcttataaaa 8880 tcttgggaca cccttaagaa ccatattattagattcggtc gctcccgaag tgaatattat 8940 ttccttgggg tcggcattga tcatctttgctacgtaagct ctagcatttt ccacagcagt 9000 atttgtttcc caaccgtaag agtgagtgttggaatgagga ttaccataaa gtcccgtata 9060 aaacttcaac atcgtatcca aaaccctagggtctgttggt gtagtggctt gcatgtcaag 9120 atatatggga cgagtaccaa aacctgtgttttcttgataa gcatggctca ttgcagtgct 9180 accagaagct actacagcat ctggggtggtaccggatgca ctcgcacggg cactagcctg 9240 tgcctttgca gcagcctgaa tatcggtatgcgtttccaga gagaagttgt cgtctaactt 9300 cacgcctgct gcagtctcaa tgatattcgaatacgctttg aggagataca gcctaatatc 9360 cgacaaactg ttttacagat ttacgatcgtacttgttacc catcattgaa ttttgaacat 9420 ccgaacctgg gagttttccc tgaaacagatagtatatttg aacctgtata ataatatata 9480 gtctagcgct ttacggaaga caatgtatgtatttcggttc ctggagaaac tattgcatct 9540 attgcatagg taatcttgca cgtcgcatccccggttcatt ttctgcgttt ccatcttgca 9600 cttcaatagc atatctttgt taacgaagcatctgtgcttc attttgtaga acaaaaatgc 9660 aacgcgagag cgctaatttt tcaaacaaagaatctgagct gcatttttac agaacagaaa 9720 tgcaacgcga aagcgctatt ttaccaacgaagaatctgtg cttcattttt gtaaaacaaa 9780 aatgcaacgc gagagcgcta atttttcaaacaaagaatct gagctgcatt tttacagaac 9840 agaaatgcaa cgcgagagcg ctattttaccaacaaagaat ctatacttct tttttgttct 9900 acaaaaatgc atcccgagag cgctatttttctaacaaagc atcttagatt actttttttc 9960 tcctttgtgc gctctataat gcagtctcttgataactttt tgcactgtag gtccgttaag 10020 gttagaagaa ggctactttg gtgtctattttctcttccat aaaaaaagcc tgactccact 10080 tcccgcgttt actgattact agcgaagctgcgggtgcatt ttttcaagat aaaggcatcc 10140 ccgattatat tctataccga tgtggattgcgcatactttg tgaacagaaa gtgatagcgt 10200 tgatgattct tcattggtca gaaaattatgaacggtttct tctattttgt ctctatatac 10260 tacgtatagg aaatgtttac attttcgtattgttttcgat tcactctatg aatagttctt 10320 actacaattt ttttgtctaa agagtaatactagagataaa cataaaaaat gtagaggtcg 10380 agtttagatg caagttcaag gagcgaaaggtggatgggta ggttatatag ggatatagca 10440 cagagatata tagcaaagag atacttttgagcaatgtttg tggaagcggt attcgcaata 10500 ttttagtagc tcgttacagt ccggtgcgtttttggttttt tgaaagtgcg tcttcagagc 10560 gcttttggtt ttcaaaagcg ctctgaagttcctatacttt ctagagaata ggaacttcgg 10620 aataggaact tcaaagcgtt tccgaaaacgagcgcttccg aaaatgcaac gcgagctgcg 10680 cacatacagc tcactgttca cgtcgcacctatatctgcgt gttgcctgta tatatatata 10740 catgagaaga acggcatagt gcgtgtttatgcttaaatgc gtacttatat gcgtctattt 10800 atgtaggatg aaaggtagtc tagtacctcctgtgatatta tcccattcca tgcggggtat 10860 cgtatgcttc cttcagcact accctttagctgttctatat gctgccactc ctcaattgga 10920 ttagtctcat ccttcaatgc tatcatttcctttgatattc gatcctaggc atagtaccga 10980 gaaactagtg cgaagtagtg atcaggtattgctgttatct gatgagtata cgttgtcctg 11040 gccacggcag aagcacgctt atcgctccaatttcccacaa cattagtcaa ctccgttagg 11100 cccttcattg aaagaaatga ggtcatcaaatgtcttccaa tgtgagattt tgggccattt 11160 tttatagcaa agattgaata aggcgcatttttcttcaa 11198 24 11427 DNA Saccharomyces cerevisiae 24 agcttcgcggccgcctttcg attagcacgc acacacatca catagactgc gtcataaaaa 60 tacactacggaaaaaccata aagagcaaag cgatacctac ttggaaggaa aaggagcacg 120 cttgtaagggggatgggggc taagaagtca ttcactttct tttcccttcg cggtccggac 180 ccgggacccctcctctcccc gcacgatttc ttcctttcat atcttccttt tattcctatc 240 ccgttgaagcaaccgcacta tgactaaatg gtgctggaca tctccatggc tgtgacttgt 300 gtgtatctcacagtggtaac ggcaccgtgg ctcggaaacg gttccttcgt gacaattcta 360 gaacaggggctacagtctcg ataatagaat aataagcgca tttttgctag cgccgccgcg 420 gcgcccgtttcccaataggg aggcgcagtt tatcggcgga gctctacttc ttcctatttg 480 ggtaagcccctttctgtttt cggccagtgg ttgctgcagg ctgcgccgga gaacatagtg 540 ataagggatgtaactttcga tgagagaatt agcaagcgga aaaaaactat ggctagctgg 600 gagttgtttttcaatcatat aaaagggaga aattgttgct cactatgtga cagtttctgg 660 gacgtcttaacttttattgc agaggactat caaatcatac agatattgtc aaaaaaaaaa 720 aagactaataataacatatg cggtccggat ccagtttaaa cagtagcttt ggacttcttc 780 gccagaggtttggtcaagtc tccaatcaag gttgtcggct tgtctacctt gccagaaatt 840 tacgaaaagatggaaaaggg tcaaatcgtt ggtagatacg ttgttgacac ttctaaataa 900 gcgaatttcttatgatttat gatttttatt attaaataag ttataaaaaa aataagtgta 960 tacaaattttaaagtgactc ttaggtttta aaacgaaaat tcttgttctt gagtaactct 1020 ttcctgtaggtcaggttgct ttctcaggta tagcatgagg tcgctcttat tgaccacacc 1080 tctaccggcatgccgagcaa atgcctgcaa atcgctcccc atttcaccca attgtagata 1140 tgctaactccagcaatgagt tgatgaatct cggtgtgtat tttatgtcct cagaagacaa 1200 cacctgttgtaatcgttctt ccacacggat cgcggccgct tgatcctcta cgccggacgc 1260 atcgtggccggcatcaccgg cgccacaggt gcggttgctg gcgcctatat cgccgacatc 1320 accgatggggaagatcgggc tcgccacttc gggctcatga gcgcttgttt cggcgtgggt 1380 atggtggcaggccccgtggc cgggggactg ttgggcgcca tctccttgca tgcaccattc 1440 cttgcggcggcggtgctcaa cggcctcaac ctactactgg gctgcttcct aatgcaggag 1500 tcgcataagggagagcgtcg accgatgccc ttgagagcct tcaacccagt cagctccttc 1560 cggtgggcgcggggcatgac tatcgtcgcc gcacttatga ctgtcttctt tatcatgcaa 1620 ctcgtaggacaggtgccggc agcgctctgg gtcattttcg gcgaggaccg ctttcgctgg 1680 agcgcgacgatgatcggcct gtcgcttgcg gtattcggaa tcttgcacgc cctcgctcaa 1740 gccttcgtcactggtcccgc caccaaacgt ttcggcgaga agcaggccat tatcgccggc 1800 atggcggccgacgcgctggg ctacgtcttg ctggcgttcg cgacgcgagg ctggatggcc 1860 ttccccattatgattcttct cgcttccggc ggcatcggga tgcccgcgtt gcaggccatg 1920 ctgtccaggcaggtagatga cgaccatcag ggacagcttc aaggatcgct cgcggctctt 1980 accagcctaacttcgatcac tggaccgctg atcgtcacgg cgatttatgc cgcctcggcg 2040 agcacatggaacgggttggc atggattgta ggcgccgccc tataccttgt ctgcctcccc 2100 gcgttgcgtcgcggtgcatg gagccgggcc acctcgacct gaatggaagc cggcggcacc 2160 tcgctaacggattcaccact ccaagaattg gagccaatca attcttgcgg agaactgtga 2220 atgcgcaaaccaacccttgg cagaacatat ccatcgcgtc cgccatctcc agcagccgca 2280 cgcggcgcatctcgggcagc gttgggtcct ggccacgggt gcgcatgatc gtgctcctgt 2340 cgttgaggacccggctaggc tggcggggtt gccttactgg ttagcagaat gaatcaccga 2400 tacgcgagcgaacgtgaagc gactgctgct gcaaaacgtc tgcgacctga gcaacaacat 2460 gaatggtcttcggtttccgt gtttcgtaaa gtctggaaac gcggaagtca gcgccctgca 2520 ccattatgttccggatctgc atcgcaggat gctgctggct accctgtgga acacctacat 2580 ctgtattaacgaagcgctgg cattgaccct gagtgatttt tctctggtcc cgccgcatcc 2640 ataccgccagttgtttaccc tcacaacgtt ccagtaaccg ggcatgttca tcatcagtaa 2700 cccgtatcgtgagcatcctc tctcgtttca tcggtatcat tacccccatg aacagaaatt 2760 cccccttacacggaggcatc aagtgaccaa acaggaaaaa accgccctta acatggcccg 2820 ctttatcagaagccagacat taacgcttct ggagaaactc aacgagctgg acgcggatga 2880 acaggcagacatctgtgaat cgcttcacga ccacgctgat gagctttacc gcagctgcct 2940 cgcgcgtttcggtgatgacg gtgaaaacct ctgacacatg cagctcccgg agacggtcac 3000 agcttgtctgtaagcggatg ccgggagcag acaagcccgt cagggcgcgt cagcgggtgt 3060 tggcgggtgtcggggcgcag ccatgaccca gtcacgtagc gatagcggag tgtatactgg 3120 cttaactatgcggcatcaga gcagattgta ctgagagtgc acgatatccg gtgtgaaata 3180 ccgcacagatgcgtaaggag aaaataccgc atcaggcgct cttccgcttc ctcgctcact 3240 gactcgctgcgctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta 3300 atacggttatccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag 3360 caaaaggccaggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc 3420 cctgacgagcatcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta 3480 taaagataccaggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg 3540 ccgcttaccggatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc 3600 tcacgctgtaggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 3660 gaaccccccgttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac 3720 ccggtaagacacgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg 3780 aggtatgtaggcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga 3840 aggacagtatttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt 3900 agctcttgatccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag 3960 cagattacgcgcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct 4020 gacgctcagtggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg 4080 atcttcacctagatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat 4140 gagtaaacttggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc 4200 tgtctatttcgttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg 4260 gagggcttaccatctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct 4320 ccagatttatcagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca 4380 actttatccgcctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg 4440 ccagttaatagtttgcgcaa cgttgttgcc attgctgcag gcatcgtggt gtcacgctcg 4500 tcgtttggtatggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc 4560 cccatgttgtgcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag 4620 ttggccgcagtgttatcact catggttatg gcagcactgc ataattctct tactgtcatg 4680 ccatccgtaagatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag 4740 tgtatgcggcgaccgagttg ctcttgcccg gcgtcaacac gggataatac cgcgccacat 4800 agcagaactttaaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg 4860 atcttaccgctgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca 4920 gcatcttttactttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca 4980 aaaaagggaataagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat 5040 tattgaagcatttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag 5100 aaaaataaacaaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgtctaa 5160 gaaaccattattatcatgac attaacctat aaaaataggc gtatcacgag gccctttcgt 5220 cttcaagaattccacggact atagactata ctagtatact ccgtctactg tacgatacac 5280 ttccgctcaggtccttgtcc tttaacgagg ccttaccact cttttgttac tctattgatc 5340 cagctcagcaaaggcagtgt gatctaagat tctatcttcg cgatgtagta aaactagcta 5400 gaccgagaaagagactagaa atgcaaaagg cacttctaca atggctgcca tcattattat 5460 ccgatgtgacgctgcagaag cagaaataca cgcggtcagt gaagctattc cgctattgaa 5520 taacctcagtcaccttgtgc aagaacttaa caagaaacca attattaaag gcttacttac 5580 tgatagtagatcaacgatca gtataattaa gtctacaaat gaagagaaat ttagaaacag 5640 attttttggcacaaaggcaa tgagacttag agatgaagta tcaggtaata atttatacgt 5700 atactacatcgagaccaaga agaacattgc tgatgtgatg acaaaacctc ttccgataaa 5760 aacatttaaactattaacta acaaatggat tcattagatc tattacatta tgggtggtat 5820 gttggaataaaaatcaacta tcatctacta actagtattt acgttactag tatattatca 5880 tatacggtgttagaagatga cgcaaatgat gagaaatagt catctaaatt agtggaagct 5940 gaaacgcaaggattgataat gtaataggat caatgaatat taacatataa aatgatgata 6000 ataatatttatagaattgtg tagaattgca gattcccttt tatggattcc taaatcctcg 6060 aggagaacttctagtatatc tacataccta atattattgc cttattaaaa atggaatccc 6120 aacaattacatcaaaatcca cattctcttc aaaatcaatt gtcctgtact tccttgttca 6180 tgtgtgttcaaaaacgttat atttatagga taattatact ctatttctca acaagtaatt 6240 ggttgtttggccgagcggtc taaggcgcct gattcaagaa atatcttgac cgcagttaac 6300 tgtgggaatactcaggtatc gtaagatgca agagttcgaa tctcttagca accattattt 6360 ttttcctcaacataacgaga acacacaggg gcgctatcgc acagaatcaa attcgatgac 6420 tggaaattttttgttaattt cagaggtcgc ctgacgcata tacctttttc aactgaaaaa 6480 ttgggagaaaaaggaaaggt gagagccgcg gaaccggctt ttcatataga atagagaagc 6540 gttcatgactaaatgcttgc atcacaatac ttgaagttga caatattatt taaggaccta 6600 ttgttttttccaataggtgg ttagcaatcg tcttactttc taacttttct taccttttac 6660 atttcagcaatatatatata tatatttcaa ggatatacca ttctaatgtc tgcccctaag 6720 aagatcgtcgttttgccagg tgaccacgtt ggtcaagaaa tcacagccga agccattaag 6780 gttcttaaagctatttctga tgttcgttcc aatgtcaagt tcgatttcga aaatcattta 6840 attggtggtgctgctatcga tgctacaggt gtcccacttc cagatgaggc gctggaagcc 6900 tccaagaaggttgatgccgt tttgttaggt gctgtgggtg gtcctaaatg gggtaccggt 6960 agtgttagacctgaacaagg tttactaaaa atccgtaaag aacttcaatt gtacgccaac 7020 ttaagaccatgtaactttgc atccgactct cttttagact tatctccaat caagccacaa 7080 tttgctaaaggtactgactt cgttgttgtc agagaattag tgggaggtat ttactttggt 7140 aagagaaaggaagacgatgg tgatggtgtc gcttgggata gtgaacaata caccgttcca 7200 gaagtgcaaagaatcacaag aatggccgct ttcatggccc tacaacatga gccaccattg 7260 cctatttggtccttggataa agctaatgtt ttggcctctt caagattatg gagaaaaact 7320 gtggaggaaaccatcaagaa cgaattccct acattgaagg ttcaacatca attgattgat 7380 tctgccgccatgatcctagt taagaaccca acccacctaa atggtattat aatcaccagc 7440 aacatgtttggtgatatcat ctccgatgaa gcctccgtta tcccaggttc cttgggtttg 7500 ttgccatctgcgtccttggc ctctttgcca gacaagaaca ccgcatttgg tttgtacgaa 7560 ccatgccacggttctgctcc agatttgcca aagaataagg tcaaccctat cgccactatc 7620 ttgtctgctgcaatgatgtt gaaattgtca ttgaacttgc ctgaagaagg taaggccatt 7680 gaagatgcagttaaaaaggt tttggatgca ggtatcagaa ctggtgattt aggtggttcc 7740 aacagtaccacggaagtcgg tgatgctgtc gccgaagaag ttaagaaaat ccttgcttaa 7800 aaagattctctttttttatg atatttgtac ataaacttta taaatgaaat tcataataga 7860 aacgacacgaaattacaaaa tggaatatgt tcatagggta gacgaaacta tatacgcaat 7920 ctacatacatttatcaagaa ggagaaaaag gaggatgtaa aggaatacag gtaagcaaat 7980 tgatactaatggctcaacgt gataaggaaa aagaattgca ctttaacatt aatattgaca 8040 aggaggagggcaccacacaa aaagttaggt gtaacagaaa atcatgaaac tatgattcct 8100 aatttatatattggaggatt ttctctaaaa aaaaaaaaat acaacaaata aaaaacactc 8160 aatgacctgaccatttgatg gagtttaagt caataccttc ttgaaccatt tcccataatg 8220 gtgaaagttccctcaagaat tttactctgt cagaaacggc cttaacgacg tagtcgacct 8280 cctcttcagtactaaatcta ccaataccaa atctgatgga agaatgggct aatgcatcat 8340 ccttacccagcgcatgtaaa acataagaag gttctaggga agcagatgta caggctgaac 8400 ccgaggataatgcgatatcc cttagtgcca tcaataaaga ttctccttcc acgtaggcga 8460 aagaaacgttaacacaccct ggataacgat gatctggaga tccgttcaac gtggtatgtt 8520 cagcggataatagacctttg actaatttat cggatagtct tttgatgtga gcttggtcgt 8580 tgtcaaattctttcttcatc aatctcgcag cttcaccaaa tcccgctacc aatggggggg 8640 ccaaagtaccagatctcaat cctctctctt ggccaccacc ggatagtaaa ggttctaatc 8700 taactcttggtctccttctt acatagatgg cacctattcc ctttggaccg taaatcttgt 8760 gagaagaaattgatagtaaa tcaatgttca tttcattgac atcaatgtga atcttaccat 8820 aggcttgtgcggcgtcagta tgaaagtaga tcttattctt tctacaaatt gcaccaattt 8880 ctttaataggttgaatgaca ccgatttcat tattgacagc catcacagag acgagacagg 8940 tatctggtctaatggcatct tccaattcct tcaaatcgat aagaccttga tcgtccacat 9000 ttaggaaagtgacttcaaat ccctccttca tcatggcccg tgcggcttcc aagacacact 9060 tgtgttccgttctagtggtg atgatgtgtt tcttagtctt cttataaaat cttgggacac 9120 ccttaagaaccatattatta gattcggtcg ctcccgaagt gaatattatt tccttggggt 9180 cggcattgatcatctttgct acgtaagctc tagcattttc cacagcagta tttgtttccc 9240 aaccgtaagagtgagtgttg gaatgaggat taccataaag tcccgtataa aacttcaaca 9300 tcgtatccaaaaccctaggg tctgttggtg tagtggcttg catgtcaaga tatatgggac 9360 gagtaccaaaacctgtgttt tcttgataag catggctcat tgcagtgcta ccagaagcta 9420 ctacagcatctggggtggta ccggatgcac tcgcacgggc actagcctgt gcctttgcag 9480 cagcctgaatatcggtatgc gtttccagag agaagttgtc gtctaacttc acgcctgctg 9540 cagtctcaatgatattcgaa tacgctttga ggagatacag cctaatatcc gacaaactgt 9600 tttacagatttacgatcgta cttgttaccc atcattgaat tttgaacatc cgaacctggg 9660 agttttccctgaaacagata gtatatttga acctgtataa taatatatag tctagcgctt 9720 tacggaagacaatgtatgta tttcggttcc tggagaaact attgcatcta ttgcataggt 9780 aatcttgcacgtcgcatccc cggttcattt tctgcgtttc catcttgcac ttcaatagca 9840 tatctttgttaacgaagcat ctgtgcttca ttttgtagaa caaaaatgca acgcgagagc 9900 gctaatttttcaaacaaaga atctgagctg catttttaca gaacagaaat gcaacgcgaa 9960 agcgctattttaccaacgaa gaatctgtgc ttcatttttg taaaacaaaa atgcaacgcg 10020 agagcgctaatttttcaaac aaagaatctg agctgcattt ttacagaaca gaaatgcaac 10080 gcgagagcgctattttacca acaaagaatc tatacttctt ttttgttcta caaaaatgca 10140 tcccgagagcgctatttttc taacaaagca tcttagatta ctttttttct cctttgtgcg 10200 ctctataatgcagtctcttg ataacttttt gcactgtagg tccgttaagg ttagaagaag 10260 gctactttggtgtctatttt ctcttccata aaaaaagcct gactccactt cccgcgttta 10320 ctgattactagcgaagctgc gggtgcattt tttcaagata aaggcatccc cgattatatt 10380 ctataccgatgtggattgcg catactttgt gaacagaaag tgatagcgtt gatgattctt 10440 cattggtcagaaaattatga acggtttctt ctattttgtc tctatatact acgtatagga 10500 aatgtttacattttcgtatt gttttcgatt cactctatga atagttctta ctacaatttt 10560 tttgtctaaagagtaatact agagataaac ataaaaaatg tagaggtcga gtttagatgc 10620 aagttcaaggagcgaaaggt ggatgggtag gttatatagg gatatagcac agagatatat 10680 agcaaagagatacttttgag caatgtttgt ggaagcggta ttcgcaatat tttagtagct 10740 cgttacagtccggtgcgttt ttggtttttt gaaagtgcgt cttcagagcg cttttggttt 10800 tcaaaagcgctctgaagttc ctatactttc tagagaatag gaacttcgga ataggaactt 10860 caaagcgtttccgaaaacga gcgcttccga aaatgcaacg cgagctgcgc acatacagct 10920 cactgttcacgtcgcaccta tatctgcgtg ttgcctgtat atatatatac atgagaagaa 10980 cggcatagtgcgtgtttatg cttaaatgcg tacttatatg cgtctattta tgtaggatga 11040 aaggtagtctagtacctcct gtgatattat cccattccat gcggggtatc gtatgcttcc 11100 ttcagcactaccctttagct gttctatatg ctgccactcc tcaattggat tagtctcatc 11160 cttcaatgctatcatttcct ttgatattcg atcctaggca tagtaccgag aaactagtgc 11220 gaagtagtgatcaggtattg ctgttatctg atgagtatac gttgtcctgg ccacggcaga 11280 agcacgcttatcgctccaat ttcccacaac attagtcaac tccgttaggc ccttcattga 11340 aagaaatgaggtcatcaaat gtcttccaat gtgagatttt gggccatttt ttatagcaaa 11400 gattgaataaggcgcatttt tcttcaa 11427 25 11201 DNA Saccharomyces cerevisiae 25aagcttcgcg gccgcgcaga aatgatgaag ggtgttagcg ccgtccactg atgtgcctgg 60tagtcatgat ttacgtataa ctaacacatc atgaggacgg cggcgtcacc ccaacgcaaa 120agagtgactt ccctgcgctt tgccaaaacc ccatacatcg ccatctggct cctggcaggg 180cggttgatgg acatcagccg cctcccttaa ttgctaaagc ctccacaagg cacaattaag 240caatatttcg ggaaagtaca ccagtcagtt tgcgctttta tgactgggtt ctaaggtact 300agatgtgaag tagtggtgac agaatcaggg agataagagg gagcagggtg gggtaatgat 360gtgcgataac aatcttgctt ggctaatcac ccccatatct tgtagtgagt atataaatag 420gagcctccct tcctattgca actccataaa attttttttt gtagccactt ctgtaacaag 480ataaataaaa ccaactaatc gagatatcac atatgcggtc cggatccagt ttaaacagta 540gctttggact tcttcgccag aggtttggtc aagtctccaa tcaaggttgt cggcttgtct 600accttgccag aaatttacga aaagatggaa aagggtcaaa tcgttggtag atacgttgtt 660gacacttcta aataagcgaa tttcttatga tttatgattt ttattattaa ataagttata 720aaaaaaataa gtgtatacaa attttaaagt gactcttagg ttttaaaacg aaaattcttg 780ttcttgagta actctttcct gtaggtcagg ttgctttctc aggtatagca tgaggtcgct 840cttattgacc acacctctac cggcatgccg agcaaatgcc tgcaaatcgc tccccatttc 900acccaattgt agatatgcta actccagcaa tgagttgatg aatctcggtg tgtattttat 960gtcctcagaa gacaacacct gttgtaatcg ttcttccaca cggatcgcgg ccgcttgatc 1020ctctacgccg gacgcatcgt ggccggcatc accggcgcca caggtgcggt tgctggcgcc 1080tatatcgccg acatcaccga tggggaagat cgggctcgcc acttcgggct catgagcgct 1140tgtttcggcg tgggtatggt ggcaggcccc gtggccgggg gactgttggg cgccatctcc 1200ttgcatgcac cattccttgc ggcggcggtg ctcaacggcc tcaacctact actgggctgc 1260ttcctaatgc aggagtcgca taagggagag cgtcgaccga tgcccttgag agccttcaac 1320ccagtcagct ccttccggtg ggcgcggggc atgactatcg tcgccgcact tatgactgtc 1380ttctttatca tgcaactcgt aggacaggtg ccggcagcgc tctgggtcat tttcggcgag 1440gaccgctttc gctggagcgc gacgatgatc ggcctgtcgc ttgcggtatt cggaatcttg 1500cacgccctcg ctcaagcctt cgtcactggt cccgccacca aacgtttcgg cgagaagcag 1560gccattatcg ccggcatggc ggccgacgcg ctgggctacg tcttgctggc gttcgcgacg 1620cgaggctgga tggccttccc cattatgatt cttctcgctt ccggcggcat cgggatgccc 1680gcgttgcagg ccatgctgtc caggcaggta gatgacgacc atcagggaca gcttcaagga 1740tcgctcgcgg ctcttaccag cctaacttcg atcactggac cgctgatcgt cacggcgatt 1800tatgccgcct cggcgagcac atggaacggg ttggcatgga ttgtaggcgc cgccctatac 1860cttgtctgcc tccccgcgtt gcgtcgcggt gcatggagcc gggccacctc gacctgaatg 1920gaagccggcg gcacctcgct aacggattca ccactccaag aattggagcc aatcaattct 1980tgcggagaac tgtgaatgcg caaaccaacc cttggcagaa catatccatc gcgtccgcca 2040tctccagcag ccgcacgcgg cgcatctcgg gcagcgttgg gtcctggcca cgggtgcgca 2100tgatcgtgct cctgtcgttg aggacccggc taggctggcg gggttgcctt actggttagc 2160agaatgaatc accgatacgc gagcgaacgt gaagcgactg ctgctgcaaa acgtctgcga 2220cctgagcaac aacatgaatg gtcttcggtt tccgtgtttc gtaaagtctg gaaacgcgga 2280agtcagcgcc ctgcaccatt atgttccgga tctgcatcgc aggatgctgc tggctaccct 2340gtggaacacc tacatctgta ttaacgaagc gctggcattg accctgagtg atttttctct 2400ggtcccgccg catccatacc gccagttgtt taccctcaca acgttccagt aaccgggcat 2460gttcatcatc agtaacccgt atcgtgagca tcctctctcg tttcatcggt atcattaccc 2520ccatgaacag aaattccccc ttacacggag gcatcaagtg accaaacagg aaaaaaccgc 2580ccttaacatg gcccgcttta tcagaagcca gacattaacg cttctggaga aactcaacga 2640gctggacgcg gatgaacagg cagacatctg tgaatcgctt cacgaccacg ctgatgagct 2700ttaccgcagc tgcctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct 2760cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 2820cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg acccagtcac gtagcgatag 2880cggagtgtat actggcttaa ctatgcggca tcagagcaga ttgtactgag agtgcacgat 2940atccggtgtg aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc 3000gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 3060cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 3120tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 3180cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 3240aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 3300cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 3360gcgctttctc aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 3420ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 3480cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 3540aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 3600tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 3660ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 3720tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 3780ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 3840agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 3900atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 3960cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 4020ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 4080ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc 4140agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct 4200agagtaagta gttcgccagt taatagtttg cgcaacgttg ttgccattgc tgcaggcatc 4260gtggtgtcac gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg 4320cgagttacat gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc 4380gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat 4440tctcttactg tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag 4500tcattctgag aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aacacgggat 4560aataccgcgc cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg 4620cgaaaactct caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca 4680cccaactgat cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga 4740aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc 4800ttcctttttc aatattattg aagcatttat cagggttatt gtctcatgag cggatacata 4860tttgaatgta tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg 4920ccacctgacg tctaagaaac cattattatc atgacattaa cctataaaaa taggcgtatc 4980acgaggccct ttcgtcttca agaattccac ggactataga ctatactagt atactccgtc 5040tactgtacga tacacttccg ctcaggtcct tgtcctttaa cgaggcctta ccactctttt 5100gttactctat tgatccagct cagcaaaggc agtgtgatct aagattctat cttcgcgatg 5160tagtaaaact agctagaccg agaaagagac tagaaatgca aaaggcactt ctacaatggc 5220tgccatcatt attatccgat gtgacgctgc agaagcagaa atacacgcgg tcagtgaagc 5280tattccgcta ttgaataacc tcagtcacct tgtgcaagaa cttaacaaga aaccaattat 5340taaaggctta cttactgata gtagatcaac gatcagtata attaagtcta caaatgaaga 5400gaaatttaga aacagatttt ttggcacaaa ggcaatgaga cttagagatg aagtatcagg 5460taataattta tacgtatact acatcgagac caagaagaac attgctgatg tgatgacaaa 5520acctcttccg ataaaaacat ttaaactatt aactaacaaa tggattcatt agatctatta 5580cattatgggt ggtatgttgg aataaaaatc aactatcatc tactaactag tatttacgtt 5640actagtatat tatcatatac ggtgttagaa gatgacgcaa atgatgagaa atagtcatct 5700aaattagtgg aagctgaaac gcaaggattg ataatgtaat aggatcaatg aatattaaca 5760tataaaatga tgataataat atttatagaa ttgtgtagaa ttgcagattc ccttttatgg 5820attcctaaat cctcgaggag aacttctagt atatctacat acctaatatt attgccttat 5880taaaaatgga atcccaacaa ttacatcaaa atccacattc tcttcaaaat caattgtcct 5940gtacttcctt gttcatgtgt gttcaaaaac gttatattta taggataatt atactctatt 6000tctcaacaag taattggttg tttggccgag cggtctaagg cgcctgattc aagaaatatc 6060ttgaccgcag ttaactgtgg gaatactcag gtatcgtaag atgcaagagt tcgaatctct 6120tagcaaccat tatttttttc ctcaacataa cgagaacaca caggggcgct atcgcacaga 6180atcaaattcg atgactggaa attttttgtt aatttcagag gtcgcctgac gcatatacct 6240ttttcaactg aaaaattggg agaaaaagga aaggtgagag ccgcggaacc ggcttttcat 6300atagaataga gaagcgttca tgactaaatg cttgcatcac aatacttgaa gttgacaata 6360ttatttaagg acctattgtt ttttccaata ggtggttagc aatcgtctta ctttctaact 6420tttcttacct tttacatttc agcaatatat atatatatat ttcaaggata taccattcta 6480atgtctgccc ctaagaagat cgtcgttttg ccaggtgacc acgttggtca agaaatcaca 6540gccgaagcca ttaaggttct taaagctatt tctgatgttc gttccaatgt caagttcgat 6600ttcgaaaatc atttaattgg tggtgctgct atcgatgcta caggtgtccc acttccagat 6660gaggcgctgg aagcctccaa gaaggttgat gccgttttgt taggtgctgt gggtggtcct 6720aaatggggta ccggtagtgt tagacctgaa caaggtttac taaaaatccg taaagaactt 6780caattgtacg ccaacttaag accatgtaac tttgcatccg actctctttt agacttatct 6840ccaatcaagc cacaatttgc taaaggtact gacttcgttg ttgtcagaga attagtggga 6900ggtatttact ttggtaagag aaaggaagac gatggtgatg gtgtcgcttg ggatagtgaa 6960caatacaccg ttccagaagt gcaaagaatc acaagaatgg ccgctttcat ggccctacaa 7020catgagccac cattgcctat ttggtccttg gataaagcta atgttttggc ctcttcaaga 7080ttatggagaa aaactgtgga ggaaaccatc aagaacgaat tccctacatt gaaggttcaa 7140catcaattga ttgattctgc cgccatgatc ctagttaaga acccaaccca cctaaatggt 7200attataatca ccagcaacat gtttggtgat atcatctccg atgaagcctc cgttatccca 7260ggttccttgg gtttgttgcc atctgcgtcc ttggcctctt tgccagacaa gaacaccgca 7320tttggtttgt acgaaccatg ccacggttct gctccagatt tgccaaagaa taaggtcaac 7380cctatcgcca ctatcttgtc tgctgcaatg atgttgaaat tgtcattgaa cttgcctgaa 7440gaaggtaagg ccattgaaga tgcagttaaa aaggttttgg atgcaggtat cagaactggt 7500gatttaggtg gttccaacag taccacggaa gtcggtgatg ctgtcgccga agaagttaag 7560aaaatccttg cttaaaaaga ttctcttttt ttatgatatt tgtacataaa ctttataaat 7620gaaattcata atagaaacga cacgaaatta caaaatggaa tatgttcata gggtagacga 7680aactatatac gcaatctaca tacatttatc aagaaggaga aaaaggagga tgtaaaggaa 7740tacaggtaag caaattgata ctaatggctc aacgtgataa ggaaaaagaa ttgcacttta 7800acattaatat tgacaaggag gagggcacca cacaaaaagt taggtgtaac agaaaatcat 7860gaaactatga ttcctaattt atatattgga ggattttctc taaaaaaaaa aaaatacaac 7920aaataaaaaa cactcaatga cctgaccatt tgatggagtt taagtcaata ccttcttgaa 7980ccatttccca taatggtgaa agttccctca agaattttac tctgtcagaa acggccttaa 8040cgacgtagtc gacctcctct tcagtactaa atctaccaat accaaatctg atggaagaat 8100gggctaatgc atcatcctta cccagcgcat gtaaaacata agaaggttct agggaagcag 8160atgtacaggc tgaacccgag gataatgcga tatcccttag tgccatcaat aaagattctc 8220cttccacgta ggcgaaagaa acgttaacac accctggata acgatgatct ggagatccgt 8280tcaacgtggt atgttcagcg gataatagac ctttgactaa tttatcggat agtcttttga 8340tgtgagcttg gtcgttgtca aattctttct tcatcaatct cgcagcttca ccaaatcccg 8400ctaccaatgg gggggccaaa gtaccagatc tcaatcctct ctcttggcca ccaccggata 8460gtaaaggttc taatctaact cttggtctcc ttcttacata gatggcacct attccctttg 8520gaccgtaaat cttgtgagaa gaaattgata gtaaatcaat gttcatttca ttgacatcaa 8580tgtgaatctt accataggct tgtgcggcgt cagtatgaaa gtagatctta ttctttctac 8640aaattgcacc aatttcttta ataggttgaa tgacaccgat ttcattattg acagccatca 8700cagagacgag acaggtatct ggtctaatgg catcttccaa ttccttcaaa tcgataagac 8760cttgatcgtc cacatttagg aaagtgactt caaatccctc cttcatcatg gcccgtgcgg 8820cttccaagac acacttgtgt tccgttctag tggtgatgat gtgtttctta gtcttcttat 8880aaaatcttgg gacaccctta agaaccatat tattagattc ggtcgctccc gaagtgaata 8940ttatttcctt ggggtcggca ttgatcatct ttgctacgta agctctagca ttttccacag 9000cagtatttgt ttcccaaccg taagagtgag tgttggaatg aggattacca taaagtcccg 9060tataaaactt caacatcgta tccaaaaccc tagggtctgt tggtgtagtg gcttgcatgt 9120caagatatat gggacgagta ccaaaacctg tgttttcttg ataagcatgg ctcattgcag 9180tgctaccaga agctactaca gcatctgggg tggtaccgga tgcactcgca cgggcactag 9240cctgtgcctt tgcagcagcc tgaatatcgg tatgcgtttc cagagagaag ttgtcgtcta 9300acttcacgcc tgctgcagtc tcaatgatat tcgaatacgc tttgaggaga tacagcctaa 9360tatccgacaa actgttttac agatttacga tcgtacttgt tacccatcat tgaattttga 9420acatccgaac ctgggagttt tccctgaaac agatagtata tttgaacctg tataataata 9480tatagtctag cgctttacgg aagacaatgt atgtatttcg gttcctggag aaactattgc 9540atctattgca taggtaatct tgcacgtcgc atccccggtt cattttctgc gtttccatct 9600tgcacttcaa tagcatatct ttgttaacga agcatctgtg cttcattttg tagaacaaaa 9660atgcaacgcg agagcgctaa tttttcaaac aaagaatctg agctgcattt ttacagaaca 9720gaaatgcaac gcgaaagcgc tattttacca acgaagaatc tgtgcttcat ttttgtaaaa 9780caaaaatgca acgcgagagc gctaattttt caaacaaaga atctgagctg catttttaca 9840gaacagaaat gcaacgcgag agcgctattt taccaacaaa gaatctatac ttcttttttg 9900ttctacaaaa atgcatcccg agagcgctat ttttctaaca aagcatctta gattactttt 9960tttctccttt gtgcgctcta taatgcagtc tcttgataac tttttgcact gtaggtccgt 10020taaggttaga agaaggctac tttggtgtct attttctctt ccataaaaaa agcctgactc 10080cacttcccgc gtttactgat tactagcgaa gctgcgggtg cattttttca agataaaggc 10140atccccgatt atattctata ccgatgtgga ttgcgcatac tttgtgaaca gaaagtgata 10200gcgttgatga ttcttcattg gtcagaaaat tatgaacggt ttcttctatt ttgtctctat 10260atactacgta taggaaatgt ttacattttc gtattgtttt cgattcactc tatgaatagt 10320tcttactaca atttttttgt ctaaagagta atactagaga taaacataaa aaatgtagag 10380gtcgagttta gatgcaagtt caaggagcga aaggtggatg ggtaggttat atagggatat 10440agcacagaga tatatagcaa agagatactt ttgagcaatg tttgtggaag cggtattcgc 10500aatattttag tagctcgtta cagtccggtg cgtttttggt tttttgaaag tgcgtcttca 10560gagcgctttt ggttttcaaa agcgctctga agttcctata ctttctagag aataggaact 10620tcggaatagg aacttcaaag cgtttccgaa aacgagcgct tccgaaaatg caacgcgagc 10680tgcgcacata cagctcactg ttcacgtcgc acctatatct gcgtgttgcc tgtatatata 10740tatacatgag aagaacggca tagtgcgtgt ttatgcttaa atgcgtactt atatgcgtct 10800atttatgtag gatgaaaggt agtctagtac ctcctgtgat attatcccat tccatgcggg 10860gtatcgtatg cttccttcag cactaccctt tagctgttct atatgctgcc actcctcaat 10920tggattagtc tcatccttca atgctatcat ttcctttgat attcgatcct aggcatagta 10980ccgagaaact agtgcgaagt agtgatcagg tattgctgtt atctgatgag tatacgttgt 11040cctggccacg gcagaagcac gcttatcgct ccaatttccc acaacattag tcaactccgt 11100taggcccttc attgaaagaa atgaggtcat caaatgtctt ccaatgtgag attttgggcc 11160attttttata gcaaagattg aataaggcgc atttttcttc a 11201 26 11204 DNASaccharomyces cerevisiae 26 aagcttcgcg gccgcggagg tctgcttcac gagcgcggtgtgcgcctagt attgccccga 60 cggtccgggt gcctatccct agatttcgtc gtgccccgacccaaatagtt aaacgtgtgg 120 tttatgggtg caccagggct ttatcgtgtt ttatatcgatggcgatttgt gcctccagtg 180 tatttttgta tatccaatta aggtttctta cctaattttatttttatcat ctttagttaa 240 tgctggtttg ctctgtttct gctgctttct gtgcggttctcctcttctct tgtttcttcg 300 tgttgtcccc catcgccgat gggcttatat ggcgtatatatatagagcga gtttttacgt 360 cgaagatcat ctcagtttgc ttgatagcct ttctactttattactttcgt ttttaacctc 420 attatacttt agttttcttt gatcggtttt tttctctgtatacttaaaag ttcaaatcaa 480 agaaacatac aaaactacgt ttatatcaat tacatatgcggtccggatcc agtttaaaca 540 gtagctttgg acttcttcgc cagaggtttg gtcaagtctccaatcaaggt tgtcggcttg 600 tctaccttgc cagaaattta cgaaaagatg gaaaagggtcaaatcgttgg tagatacgtt 660 gttgacactt ctaaataagc gaatttctta tgatttatgatttttattat taaataagtt 720 ataaaaaaaa taagtgtata caaattttaa agtgactcttaggttttaaa acgaaaattc 780 ttgttcttga gtaactcttt cctgtaggtc aggttgctttctcaggtata gcatgaggtc 840 gctcttattg accacacctc taccggcatg ccgagcaaatgcctgcaaat cgctccccat 900 ttcacccaat tgtagatatg ctaactccag caatgagttgatgaatctcg gtgtgtattt 960 tatgtcctca gaagacaaca cctgttgtaa tcgttcttccacacggatcg cggccgcttg 1020 atcctctacg ccggacgcat cgtggccggc atcaccggcgccacaggtgc ggttgctggc 1080 gcctatatcg ccgacatcac cgatggggaa gatcgggctcgccacttcgg gctcatgagc 1140 gcttgtttcg gcgtgggtat ggtggcaggc cccgtggccgggggactgtt gggcgccatc 1200 tccttgcatg caccattcct tgcggcggcg gtgctcaacggcctcaacct actactgggc 1260 tgcttcctaa tgcaggagtc gcataaggga gagcgtcgaccgatgccctt gagagccttc 1320 aacccagtca gctccttccg gtgggcgcgg ggcatgactatcgtcgccgc acttatgact 1380 gtcttcttta tcatgcaact cgtaggacag gtgccggcagcgctctgggt cattttcggc 1440 gaggaccgct ttcgctggag cgcgacgatg atcggcctgtcgcttgcggt attcggaatc 1500 ttgcacgccc tcgctcaagc cttcgtcact ggtcccgccaccaaacgttt cggcgagaag 1560 caggccatta tcgccggcat ggcggccgac gcgctgggctacgtcttgct ggcgttcgcg 1620 acgcgaggct ggatggcctt ccccattatg attcttctcgcttccggcgg catcgggatg 1680 cccgcgttgc aggccatgct gtccaggcag gtagatgacgaccatcaggg acagcttcaa 1740 ggatcgctcg cggctcttac cagcctaact tcgatcactggaccgctgat cgtcacggcg 1800 atttatgccg cctcggcgag cacatggaac gggttggcatggattgtagg cgccgcccta 1860 taccttgtct gcctccccgc gttgcgtcgc ggtgcatggagccgggccac ctcgacctga 1920 atggaagccg gcggcacctc gctaacggat tcaccactccaagaattgga gccaatcaat 1980 tcttgcggag aactgtgaat gcgcaaacca acccttggcagaacatatcc atcgcgtccg 2040 ccatctccag cagccgcacg cggcgcatct cgggcagcgttgggtcctgg ccacgggtgc 2100 gcatgatcgt gctcctgtcg ttgaggaccc ggctaggctggcggggttgc cttactggtt 2160 agcagaatga atcaccgata cgcgagcgaa cgtgaagcgactgctgctgc aaaacgtctg 2220 cgacctgagc aacaacatga atggtcttcg gtttccgtgtttcgtaaagt ctggaaacgc 2280 ggaagtcagc gccctgcacc attatgttcc ggatctgcatcgcaggatgc tgctggctac 2340 cctgtggaac acctacatct gtattaacga agcgctggcattgaccctga gtgatttttc 2400 tctggtcccg ccgcatccat accgccagtt gtttaccctcacaacgttcc agtaaccggg 2460 catgttcatc atcagtaacc cgtatcgtga gcatcctctctcgtttcatc ggtatcatta 2520 cccccatgaa cagaaattcc cccttacacg gaggcatcaagtgaccaaac aggaaaaaac 2580 cgcccttaac atggcccgct ttatcagaag ccagacattaacgcttctgg agaaactcaa 2640 cgagctggac gcggatgaac aggcagacat ctgtgaatcgcttcacgacc acgctgatga 2700 gctttaccgc agctgcctcg cgcgtttcgg tgatgacggtgaaaacctct gacacatgca 2760 gctcccggag acggtcacag cttgtctgta agcggatgccgggagcagac aagcccgtca 2820 gggcgcgtca gcgggtgttg gcgggtgtcg gggcgcagccatgacccagt cacgtagcga 2880 tagcggagtg tatactggct taactatgcg gcatcagagcagattgtact gagagtgcac 2940 gatatccggt gtgaaatacc gcacagatgc gtaaggagaaaataccgcat caggcgctct 3000 tccgcttcct cgctcactga ctcgctgcgc tcggtcgttcggctgcggcg agcggtatca 3060 gctcactcaa aggcggtaat acggttatcc acagaatcaggggataacgc aggaaagaac 3120 atgtgagcaa aaggccagca aaaggccagg aaccgtaaaaaggccgcgtt gctggcgttt 3180 ttccataggc tccgcccccc tgacgagcat cacaaaaatcgacgctcaag tcagaggtgg 3240 cgaaacccga caggactata aagataccag gcgtttccccctggaagctc cctcgtgcgc 3300 tctcctgttc cgaccctgcc gcttaccgga tacctgtccgcctttctccc ttcgggaagc 3360 gtggcgcttt ctcaatgctc acgctgtagg tatctcagttcggtgtaggt cgttcgctcc 3420 aagctgggct gtgtgcacga accccccgtt cagcccgaccgctgcgcctt atccggtaac 3480 tatcgtcttg agtccaaccc ggtaagacac gacttatcgccactggcagc agccactggt 3540 aacaggatta gcagagcgag gtatgtaggc ggtgctacagagttcttgaa gtggtggcct 3600 aactacggct acactagaag gacagtattt ggtatctgcgctctgctgaa gccagttacc 3660 ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaaccaccgctgg tagcggtggt 3720 ttttttgttt gcaagcagca gattacgcgc agaaaaaaaggatctcaaga agatcctttg 3780 atcttttcta cggggtctga cgctcagtgg aacgaaaactcacgttaagg gattttggtc 3840 atgagattat caaaaaggat cttcacctag atccttttaaattaaaaatg aagttttaaa 3900 tcaatctaaa gtatatatga gtaaacttgg tctgacagttaccaatgctt aatcagtgag 3960 gcacctatct cagcgatctg tctatttcgt tcatccatagttgcctgact ccccgtcgtg 4020 tagataacta cgatacggga gggcttacca tctggccccagtgctgcaat gataccgcga 4080 gacccacgct caccggctcc agatttatca gcaataaaccagccagccgg aagggccgag 4140 cgcagaagtg gtcctgcaac tttatccgcc tccatccagtctattaattg ttgccgggaa 4200 gctagagtaa gtagttcgcc agttaatagt ttgcgcaacgttgttgccat tgctgcaggc 4260 atcgtggtgt cacgctcgtc gtttggtatg gcttcattcagctccggttc ccaacgatca 4320 aggcgagtta catgatcccc catgttgtgc aaaaaagcggttagctcctt cggtcctccg 4380 atcgttgtca gaagtaagtt ggccgcagtg ttatcactcatggttatggc agcactgcat 4440 aattctctta ctgtcatgcc atccgtaaga tgcttttctgtgactggtga gtactcaacc 4500 aagtcattct gagaatagtg tatgcggcga ccgagttgctcttgcccggc gtcaacacgg 4560 gataataccg cgccacatag cagaacttta aaagtgctcatcattggaaa acgttcttcg 4620 gggcgaaaac tctcaaggat cttaccgctg ttgagatccagttcgatgta acccactcgt 4680 gcacccaact gatcttcagc atcttttact ttcaccagcgtttctgggtg agcaaaaaca 4740 ggaaggcaaa atgccgcaaa aaagggaata agggcgacacggaaatgttg aatactcata 4800 ctcttccttt ttcaatatta ttgaagcatt tatcagggttattgtctcat gagcggatac 4860 atatttgaat gtatttagaa aaataaacaa ataggggttccgcgcacatt tccccgaaaa 4920 gtgccacctg acgtctaaga aaccattatt atcatgacattaacctataa aaataggcgt 4980 atcacgaggc cctttcgtct tcaagaattc cacggactatagactatact agtatactcc 5040 gtctactgta cgatacactt ccgctcaggt ccttgtcctttaacgaggcc ttaccactct 5100 tttgttactc tattgatcca gctcagcaaa ggcagtgtgatctaagattc tatcttcgcg 5160 atgtagtaaa actagctaga ccgagaaaga gactagaaatgcaaaaggca cttctacaat 5220 ggctgccatc attattatcc gatgtgacgc tgcagaagcagaaatacacg cggtcagtga 5280 agctattccg ctattgaata acctcagtca ccttgtgcaagaacttaaca agaaaccaat 5340 tattaaaggc ttacttactg atagtagatc aacgatcagtataattaagt ctacaaatga 5400 agagaaattt agaaacagat tttttggcac aaaggcaatgagacttagag atgaagtatc 5460 aggtaataat ttatacgtat actacatcga gaccaagaagaacattgctg atgtgatgac 5520 aaaacctctt ccgataaaaa catttaaact attaactaacaaatggattc attagatcta 5580 ttacattatg ggtggtatgt tggaataaaa atcaactatcatctactaac tagtatttac 5640 gttactagta tattatcata tacggtgtta gaagatgacgcaaatgatga gaaatagtca 5700 tctaaattag tggaagctga aacgcaagga ttgataatgtaataggatca atgaatatta 5760 acatataaaa tgatgataat aatatttata gaattgtgtagaattgcaga ttccctttta 5820 tggattccta aatcctcgag gagaacttct agtatatctacatacctaat attattgcct 5880 tattaaaaat ggaatcccaa caattacatc aaaatccacattctcttcaa aatcaattgt 5940 cctgtacttc cttgttcatg tgtgttcaaa aacgttatatttataggata attatactct 6000 atttctcaac aagtaattgg ttgtttggcc gagcggtctaaggcgcctga ttcaagaaat 6060 atcttgaccg cagttaactg tgggaatact caggtatcgtaagatgcaag agttcgaatc 6120 tcttagcaac cattattttt ttcctcaaca taacgagaacacacaggggc gctatcgcac 6180 agaatcaaat tcgatgactg gaaatttttt gttaatttcagaggtcgcct gacgcatata 6240 cctttttcaa ctgaaaaatt gggagaaaaa ggaaaggtgagagccgcgga accggctttt 6300 catatagaat agagaagcgt tcatgactaa atgcttgcatcacaatactt gaagttgaca 6360 atattattta aggacctatt gttttttcca ataggtggttagcaatcgtc ttactttcta 6420 acttttctta ccttttacat ttcagcaata tatatatatatatttcaagg atataccatt 6480 ctaatgtctg cccctaagaa gatcgtcgtt ttgccaggtgaccacgttgg tcaagaaatc 6540 acagccgaag ccattaaggt tcttaaagct atttctgatgttcgttccaa tgtcaagttc 6600 gatttcgaaa atcatttaat tggtggtgct gctatcgatgctacaggtgt cccacttcca 6660 gatgaggcgc tggaagcctc caagaaggtt gatgccgttttgttaggtgc tgtgggtggt 6720 cctaaatggg gtaccggtag tgttagacct gaacaaggtttactaaaaat ccgtaaagaa 6780 cttcaattgt acgccaactt aagaccatgt aactttgcatccgactctct tttagactta 6840 tctccaatca agccacaatt tgctaaaggt actgacttcgttgttgtcag agaattagtg 6900 ggaggtattt actttggtaa gagaaaggaa gacgatggtgatggtgtcgc ttgggatagt 6960 gaacaataca ccgttccaga agtgcaaaga atcacaagaatggccgcttt catggcccta 7020 caacatgagc caccattgcc tatttggtcc ttggataaagctaatgtttt ggcctcttca 7080 agattatgga gaaaaactgt ggaggaaacc atcaagaacgaattccctac attgaaggtt 7140 caacatcaat tgattgattc tgccgccatg atcctagttaagaacccaac ccacctaaat 7200 ggtattataa tcaccagcaa catgtttggt gatatcatctccgatgaagc ctccgttatc 7260 ccaggttcct tgggtttgtt gccatctgcg tccttggcctctttgccaga caagaacacc 7320 gcatttggtt tgtacgaacc atgccacggt tctgctccagatttgccaaa gaataaggtc 7380 aaccctatcg ccactatctt gtctgctgca atgatgttgaaattgtcatt gaacttgcct 7440 gaagaaggta aggccattga agatgcagtt aaaaaggttttggatgcagg tatcagaact 7500 ggtgatttag gtggttccaa cagtaccacg gaagtcggtgatgctgtcgc cgaagaagtt 7560 aagaaaatcc ttgcttaaaa agattctctt tttttatgatatttgtacat aaactttata 7620 aatgaaattc ataatagaaa cgacacgaaa ttacaaaatggaatatgttc atagggtaga 7680 cgaaactata tacgcaatct acatacattt atcaagaaggagaaaaagga ggatgtaaag 7740 gaatacaggt aagcaaattg atactaatgg ctcaacgtgataaggaaaaa gaattgcact 7800 ttaacattaa tattgacaag gaggagggca ccacacaaaaagttaggtgt aacagaaaat 7860 catgaaacta tgattcctaa tttatatatt ggaggattttctctaaaaaa aaaaaaatac 7920 aacaaataaa aaacactcaa tgacctgacc atttgatggagtttaagtca ataccttctt 7980 gaaccatttc ccataatggt gaaagttccc tcaagaattttactctgtca gaaacggcct 8040 taacgacgta gtcgacctcc tcttcagtac taaatctaccaataccaaat ctgatggaag 8100 aatgggctaa tgcatcatcc ttacccagcg catgtaaaacataagaaggt tctagggaag 8160 cagatgtaca ggctgaaccc gaggataatg cgatatcccttagtgccatc aataaagatt 8220 ctccttccac gtaggcgaaa gaaacgttaa cacaccctggataacgatga tctggagatc 8280 cgttcaacgt ggtatgttca gcggataata gacctttgactaatttatcg gatagtcttt 8340 tgatgtgagc ttggtcgttg tcaaattctt tcttcatcaatctcgcagct tcaccaaatc 8400 ccgctaccaa tgggggggcc aaagtaccag atctcaatcctctctcttgg ccaccaccgg 8460 atagtaaagg ttctaatcta actcttggtc tccttcttacatagatggca cctattccct 8520 ttggaccgta aatcttgtga gaagaaattg atagtaaatcaatgttcatt tcattgacat 8580 caatgtgaat cttaccatag gcttgtgcgg cgtcagtatgaaagtagatc ttattctttc 8640 tacaaattgc accaatttct ttaataggtt gaatgacaccgatttcatta ttgacagcca 8700 tcacagagac gagacaggta tctggtctaa tggcatcttccaattccttc aaatcgataa 8760 gaccttgatc gtccacattt aggaaagtga cttcaaatccctccttcatc atggcccgtg 8820 cggcttccaa gacacacttg tgttccgttc tagtggtgatgatgtgtttc ttagtcttct 8880 tataaaatct tgggacaccc ttaagaacca tattattagattcggtcgct cccgaagtga 8940 atattatttc cttggggtcg gcattgatca tctttgctacgtaagctcta gcattttcca 9000 cagcagtatt tgtttcccaa ccgtaagagt gagtgttggaatgaggatta ccataaagtc 9060 ccgtataaaa cttcaacatc gtatccaaaa ccctagggtctgttggtgta gtggcttgca 9120 tgtcaagata tatgggacga gtaccaaaac ctgtgttttcttgataagca tggctcattg 9180 cagtgctacc agaagctact acagcatctg gggtggtaccggatgcactc gcacgggcac 9240 tagcctgtgc ctttgcagca gcctgaatat cggtatgcgtttccagagag aagttgtcgt 9300 ctaacttcac gcctgctgca gtctcaatga tattcgaatacgctttgagg agatacagcc 9360 taatatccga caaactgttt tacagattta cgatcgtacttgttacccat cattgaattt 9420 tgaacatccg aacctgggag ttttccctga aacagatagtatatttgaac ctgtataata 9480 atatatagtc tagcgcttta cggaagacaa tgtatgtatttcggttcctg gagaaactat 9540 tgcatctatt gcataggtaa tcttgcacgt cgcatccccggttcattttc tgcgtttcca 9600 tcttgcactt caatagcata tctttgttaa cgaagcatctgtgcttcatt ttgtagaaca 9660 aaaatgcaac gcgagagcgc taatttttca aacaaagaatctgagctgca tttttacaga 9720 acagaaatgc aacgcgaaag cgctatttta ccaacgaagaatctgtgctt catttttgta 9780 aaacaaaaat gcaacgcgag agcgctaatt tttcaaacaaagaatctgag ctgcattttt 9840 acagaacaga aatgcaacgc gagagcgcta ttttaccaacaaagaatcta tacttctttt 9900 27 12008 DNA Saccharomyces cerevisiae 27gaattctcat gtttgacagc ttatcatcga taagctttaa tgcggtagtt tatcacagtt 60aaattgctaa cgcagtcagg caccgtgtat gaaatctaac aatgcgctca tcgtcatcct 120cggcaccgtc accctggatg ctgtaggcat aggcttggtt atgccggtac tgccgggcct 180cttgcgggat atcgtccatt ccgacagcat cgccagtcac tatggcgtgc tgctagcgct 240atatgcgttg atgcaatttc tatgcgcacc cgttctcgga gcactgtccg accgctttgg 300ccgccgccca gtcctgctcg cttcgctact tggagccact atcgactacg cgatcatggc 360gaccacaccc gtcctgtgga tcaagcggcc gcagtacgta atgcggtatc gtgaaagcga 420aaaaaaaact aacagtagat aagacagata gacagataga gatggacgag aaacaggggg 480ggagaaaagg ggaaaagaga aggaaagaaa gactcatcta tcgcagataa gacaatcaac 540cctcatggcg cctccaacca ccatccgcac tagggaccaa gcgctcgcac cgttagcaac 600gcttgactca caaaccaact gccggctgaa agagcttgtg caatgggagt gccaattcaa 660aggagccgaa tacgtctgct cgccttttaa gaggcttttt gaacactgca ttgcacccga 720caaatcagcc actaactacg aggtcacgga cacatatacc aatagttaaa aattacatat 780actctatata gcacagtagt gtgataaata aaaaattttg ccaagacttt tttaaactgc 840acccgacaga tcaggtctgt gcctactatg cacttatgcc cggggtcccg ggaggagaaa 900aaacgagggc tgggaaatgt ccgtggactt taaacgctcc gggttagcag agtagcaggg 960ctttcggctt tggaaattta ggtgacttgt tgaaaaagca aaatttgggc tcagtaatgc 1020cactgcagtg gcttatcacg ccaggactgc gggagtggcg ggggcaaaca cacccgcgat 1080aaagagcgcg atgaatataa aagggggcca atgttacgtc ccgttatatt ggagttcttc 1140ccatacaaac ttaagagtcc aattagcttc atcgccaata aaaaaacaag ctaaacctaa 1200ttctaacaag cacatatgcg gtccggatcc agtttaaaca gtagctttgg acttcttcgc 1260cagaggtttg gtcaagtctc caatcaaggt tgtcggcttg tctaccttgc cagaaattta 1320cgaaaagatg gaaaagggtc aaatcgttgg tagatacgtt gttgacactt ctaaataagc 1380gaatttctta tgatttatga tttttattat taaataagtt ataaaaaaaa taagtgtata 1440caaattttaa agtgactctt aggttttaaa acgaaaattc ttgttcttga gtaactcttt 1500cctgtaggtc aggttgcttt ctcaggtata gcatgaggtc gctcttattg accacacctc 1560taccggcatg ccgagcaaat gcctgcaaat cgctccccat ttcacccaat tgtagatatg 1620ctaactccag caatgagttg atgaatctcg gtgtgtattt tatgtcctca gaagacaaca 1680cctgttgtaa tcgttcttcc acacggatcg cggccgcttg atcctctacg ccggacgcat 1740cgtggccggc atcaccggcg ccacaggtgc ggttgctggc gcctatatcg ccgacatcac 1800cgatggggaa gatcgggctc gccacttcgg gctcatgagc gcttgtttcg gcgtgggtat 1860ggtggcaggc cccgtggccg ggggactgtt gggcgccatc tccttgcatg caccattcct 1920tgcggcggcg gtgctcaacg gcctcaacct actactgggc tgcttcctaa tgcaggagtc 1980gcataaggga gagcgtcgac cgatgccctt gagagccttc aacccagtca gctccttccg 2040gtgggcgcgg ggcatgacta tcgtcgccgc acttatgact gtcttcttta tcatgcaact 2100cgtaggacag gtgccggcag cgctctgggt cattttcggc gaggaccgct ttcgctggag 2160cgcgacgatg atcggcctgt cgcttgcggt attcggaatc ttgcacgccc tcgctcaagc 2220cttcgtcact ggtcccgcca ccaaacgttt cggcgagaag caggccatta tcgccggcat 2280ggcggccgac gcgctgggct acgtcttgct ggcgttcgcg acgcgaggct ggatggcctt 2340ccccattatg attcttctcg cttccggcgg catcgggatg cccgcgttgc aggccatgct 2400gtccaggcag gtagatgacg accatcaggg acagcttcaa ggatcgctcg cggctcttac 2460cagcctaact tcgatcactg gaccgctgat cgtcacggcg atttatgccg cctcggcgag 2520cacatggaac gggttggcat ggattgtagg cgccgcccta taccttgtct gcctccccgc 2580gttgcgtcgc ggtgcatgga gccgggccac ctcgacctga atggaagccg gcggcacctc 2640gctaacggat tcaccactcc aagaattgga gccaatcaat tcttgcggag aactgtgaat 2700gcgcaaacca acccttggca gaacatatcc atcgcgtccg ccatctccag cagccgcacg 2760cggcgcatct cgggcagcgt tgggtcctgg ccacgggtgc gcatgatcgt gctcctgtcg 2820ttgaggaccc ggctaggctg gcggggttgc cttactggtt agcagaatga atcaccgata 2880cgcgagcgaa cgtgaagcga ctgctgctgc aaaacgtctg cgacctgagc aacaacatga 2940atggtcttcg gtttccgtgt ttcgtaaagt ctggaaacgc ggaagtcagc gccctgcacc 3000attatgttcc ggatctgcat cgcaggatgc tgctggctac cctgtggaac acctacatct 3060gtattaacga agcgctggca ttgaccctga gtgatttttc tctggtcccg ccgcatccat 3120accgccagtt gtttaccctc acaacgttcc agtaaccggg catgttcatc atcagtaacc 3180cgtatcgtga gcatcctctc tcgtttcatc ggtatcatta cccccatgaa cagaaattcc 3240cccttacacg gaggcatcaa gtgaccaaac aggaaaaaac cgcccttaac atggcccgct 3300ttatcagaag ccagacatta acgcttctgg agaaactcaa cgagctggac gcggatgaac 3360aggcagacat ctgtgaatcg cttcacgacc acgctgatga gctttaccgc agctgcctcg 3420cgcgtttcgg tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag 3480cttgtctgta agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg 3540gcgggtgtcg gggcgcagcc atgacccagt cacgtagcga tagcggagtg tatactggct 3600taactatgcg gcatcagagc agattgtact gagagtgcac gatatccggt gtgaaatacc 3660gcacagatgc gtaaggagaa aataccgcat caggcgctct tccgcttcct cgctcactga 3720ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 3780acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 3840aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 3900tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 3960aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 4020gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcaatgctc 4080acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 4140accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 4200ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 4260gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 4320gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 4380ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 4440gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 4500cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 4560cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 4620gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 4680tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga 4740gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc 4800agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 4860tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 4920agttaatagt ttgcgcaacg ttgttgccat tgctgcaggc atcgtggtgt cacgctcgtc 4980gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 5040catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 5100ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 5160atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 5220tatgcggcga ccgagttgct cttgcccggc gtcaacacgg gataataccg cgccacatag 5280cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 5340cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 5400atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 5460aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 5520ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5580aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5640aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtct 5700tcaagaattc cacggactat agactatact agtatactcc gtctactgta cgatacactt 5760ccgctcaggt ccttgtcctt taacgaggcc ttaccactct tttgttactc tattgatcca 5820gctcagcaaa ggcagtgtga tctaagattc tatcttcgcg atgtagtaaa actagctaga 5880ccgagaaaga gactagaaat gcaaaaggca cttctacaat ggctgccatc attattatcc 5940gatgtgacgc tgcagaagca gaaatacacg cggtcagtga agctattccg ctattgaata 6000acctcagtca ccttgtgcaa gaacttaaca agaaaccaat tattaaaggc ttacttactg 6060atagtagatc aacgatcagt ataattaagt ctacaaatga agagaaattt agaaacagat 6120tttttggcac aaaggcaatg agacttagag atgaagtatc aggtaataat ttatacgtat 6180actacatcga gaccaagaag aacattgctg atgtgatgac aaaacctctt ccgataaaaa 6240catttaaact attaactaac aaatggattc attagatcta ttacattatg ggtggtatgt 6300tggaataaaa atcaactatc atctactaac tagtatttac gttactagta tattatcata 6360tacggtgtta gaagatgacg caaatgatga gaaatagtca tctaaattag tggaagctga 6420aacgcaagga ttgataatgt aataggatca atgaatatta acatataaaa tgatgataat 6480aatatttata gaattgtgta gaattgcaga ttccctttta tggattccta aatcctcgag 6540gagaacttct agtatatcta catacctaat attattgcct tattaaaaat ggaatcccaa 6600caattacatc aaaatccaca ttctcttcaa aatcaattgt cctgtacttc cttgttcatg 6660tgtgttcaaa aacgttatat ttataggata attatactct atttctcaac aagtaattgg 6720ttgtttggcc gagcggtcta aggcgcctga ttcaagaaat atcttgaccg cagttaactg 6780tgggaatact caggtatcgt aagatgcaag agttcgaatc tcttagcaac cattattttt 6840ttcctcaaca taacgagaac acacaggggc gctatcgcac agaatcaaat tcgatgactg 6900gaaatttttt gttaatttca gaggtcgcct gacgcatata cctttttcaa ctgaaaaatt 6960gggagaaaaa ggaaaggtga gagccgcgga accggctttt catatagaat agagaagcgt 7020tcatgactaa atgcttgcat cacaatactt gaagttgaca atattattta aggacctatt 7080gttttttcca ataggtggtt agcaatcgtc ttactttcta acttttctta ccttttacat 7140ttcagcaata tatatatata tatttcaagg atataccatt ctaatgtctg cccctaagaa 7200gatcgtcgtt ttgccaggtg accacgttgg tcaagaaatc acagccgaag ccattaaggt 7260tcttaaagct atttctgatg ttcgttccaa tgtcaagttc gatttcgaaa atcatttaat 7320tggtggtgct gctatcgatg ctacaggtgt cccacttcca gatgaggcgc tggaagcctc 7380caagaaggtt gatgccgttt tgttaggtgc tgtgggtggt cctaaatggg gtaccggtag 7440tgttagacct gaacaaggtt tactaaaaat ccgtaaagaa cttcaattgt acgccaactt 7500aagaccatgt aactttgcat ccgactctct tttagactta tctccaatca agccacaatt 7560tgctaaaggt actgacttcg ttgttgtcag agaattagtg ggaggtattt actttggtaa 7620gagaaaggaa gacgatggtg atggtgtcgc ttgggatagt gaacaataca ccgttccaga 7680agtgcaaaga atcacaagaa tggccgcttt catggcccta caacatgagc caccattgcc 7740tatttggtcc ttggataaag ctaatgtttt ggcctcttca agattatgga gaaaaactgt 7800ggaggaaacc atcaagaacg aattccctac attgaaggtt caacatcaat tgattgattc 7860tgccgccatg atcctagtta agaacccaac ccacctaaat ggtattataa tcaccagcaa 7920catgtttggt gatatcatct ccgatgaagc ctccgttatc ccaggttcct tgggtttgtt 7980gccatctgcg tccttggcct ctttgccaga caagaacacc gcatttggtt tgtacgaacc 8040atgccacggt tctgctccag atttgccaaa gaataaggtc aaccctatcg ccactatctt 8100gtctgctgca atgatgttga aattgtcatt gaacttgcct gaagaaggta aggccattga 8160agatgcagtt aaaaaggttt tggatgcagg tatcagaact ggtgatttag gtggttccaa 8220cagtaccacg gaagtcggtg atgctgtcgc cgaagaagtt aagaaaatcc ttgcttaaaa 8280agattctctt tttttatgat atttgtacat aaactttata aatgaaattc ataatagaaa 8340cgacacgaaa ttacaaaatg gaatatgttc atagggtaga cgaaactata tacgcaatct 8400acatacattt atcaagaagg agaaaaagga ggatgtaaag gaatacaggt aagcaaattg 8460atactaatgg ctcaacgtga taaggaaaaa gaattgcact ttaacattaa tattgacaag 8520gaggagggca ccacacaaaa agttaggtgt aacagaaaat catgaaacta tgattcctaa 8580tttatatatt ggaggatttt ctctaaaaaa aaaaaaatac aacaaataaa aaacactcaa 8640tgacctgacc atttgatgga gtttaagtca ataccttctt gaaccatttc ccataatggt 8700gaaagttccc tcaagaattt tactctgtca gaaacggcct taacgacgta gtcgacctcc 8760tcttcagtac taaatctacc aataccaaat ctgatggaag aatgggctaa tgcatcatcc 8820ttacccagcg catgtaaaac ataagaaggt tctagggaag cagatgtaca ggctgaaccc 8880gaggataatg cgatatccct tagtgccatc aataaagatt ctccttccac gtaggcgaaa 8940gaaacgttaa cacaccctgg ataacgatga tctggagatc cgttcaacgt ggtatgttca 9000gcggataata gacctttgac taatttatcg gatagtcttt tgatgtgagc ttggtcgttg 9060tcaaattctt tcttcatcaa tctcgcagct tcaccaaatc ccgctaccaa tgggggggcc 9120aaagtaccag atctcaatcc tctctcttgg ccaccaccgg atagtaaagg ttctaatcta 9180actcttggtc tccttcttac atagatggca cctattccct ttggaccgta aatcttgtga 9240gaagaaattg atagtaaatc aatgttcatt tcattgacat caatgtgaat cttaccatag 9300gcttgtgcgg cgtcagtatg aaagtagatc ttattctttc tacaaattgc accaatttct 9360ttaataggtt gaatgacacc gatttcatta ttgacagcca tcacagagac gagacaggta 9420tctggtctaa tggcatcttc caattccttc aaatcgataa gaccttgatc gtccacattt 9480aggaaagtga cttcaaatcc ctccttcatc atggcccgtg cggcttccaa gacacacttg 9540tgttccgttc tagtggtgat gatgtgtttc ttagtcttct tataaaatct tgggacaccc 9600ttaagaacca tattattaga ttcggtcgct cccgaagtga atattatttc cttggggtcg 9660gcattgatca tctttgctac gtaagctcta gcattttcca cagcagtatt tgtttcccaa 9720ccgtaagagt gagtgttgga atgaggatta ccataaagtc ccgtataaaa cttcaacatc 9780gtatccaaaa ccctagggtc tgttggtgta gtggcttgca tgtcaagata tatgggacga 9840gtaccaaaac ctgtgttttc ttgataagca tggctcattg cagtgctacc agaagctact 9900acagcatctg gggtggtacc ggatgcactc gcacgggcac tagcctgtgc ctttgcagca 9960gcctgaatat cggtatgcgt ttccagagag aagttgtcgt ctaacttcac gcctgctgca 10020gtctcaatga tattcgaata cgctttgagg agatacagcc taatatccga caaactgttt 10080tacagattta cgatcgtact tgttacccat cattgaattt tgaacatccg aacctgggag 10140ttttccctga aacagatagt atatttgaac ctgtataata atatatagtc tagcgcttta 10200cggaagacaa tgtatgtatt tcggttcctg gagaaactat tgcatctatt gcataggtaa 10260tcttgcacgt cgcatccccg gttcattttc tgcgtttcca tcttgcactt caatagcata 10320tctttgttaa cgaagcatct gtgcttcatt ttgtagaaca aaaatgcaac gcgagagcgc 10380taatttttca aacaaagaat ctgagctgca tttttacaga acagaaatgc aacgcgaaag 10440cgctatttta ccaacgaaga atctgtgctt catttttgta aaacaaaaat gcaacgcgag 10500agcgctaatt tttcaaacaa agaatctgag ctgcattttt acagaacaga aatgcaacgc 10560gagagcgcta ttttaccaac aaagaatcta tacttctttt ttgttctaca aaaatgcatc 10620ccgagagcgc tatttttcta acaaagcatc ttagattact ttttttctcc tttgtgcgct 10680ctataatgca gtctcttgat aactttttgc actgtaggtc cgttaaggtt agaagaaggc 10740tactttggtg tctattttct cttccataaa aaaagcctga ctccacttcc cgcgtttact 10800gattactagc gaagctgcgg gtgcattttt tcaagataaa ggcatccccg attatattct 10860ataccgatgt ggattgcgca tactttgtga acagaaagtg atagcgttga tgattcttca 10920ttggtcagaa aattatgaac ggtttcttct attttgtctc tatatactac gtataggaaa 10980tgtttacatt ttcgtattgt tttcgattca ctctatgaat agttcttact acaatttttt 11040tgtctaaaga gtaatactag agataaacat aaaaaatgta gaggtcgagt ttagatgcaa 11100gttcaaggag cgaaaggtgg atgggtaggt tatataggga tatagcacag agatatatag 11160caaagagata cttttgagca atgtttgtgg aagcggtatt cgcaatattt tagtagctcg 11220ttacagtccg gtgcgttttt ggttttttga aagtgcgtct tcagagcgct tttggttttc 11280aaaagcgctc tgaagttcct atactttcta gagaatagga acttcggaat aggaacttca 11340aagcgtttcc gaaaacgagc gcttccgaaa atgcaacgcg agctgcgcac atacagctca 11400ctgttcacgt cgcacctata tctgcgtgtt gcctgtatat atatatacat gagaagaacg 11460gcatagtgcg tgtttatgct taaatgcgta cttatatgcg tctatttatg taggatgaaa 11520ggtagtctag tacctcctgt gatattatcc cattccatgc ggggtatcgt atgcttcctt 11580cagcactacc ctttagctgt tctatatgct gccactcctc aattggatta gtctcatcct 11640tcaatgctat catttccttt gatattcgat cctaggcata gtaccgagaa actagtgcga 11700agtagtgatc aggtattgct gttatctgat gagtatacgt tgtcctggcc acggcagaag 11760cacgcttatc gctccaattt cccacaacat tagtcaactc cgttaggccc ttcattgaaa 11820gaaatgaggt catcaaatgt cttccaatgt gagattttgg gccatttttt atagcaaaga 11880ttgaataagg cgcatttttc ttcaaagctt tattgtacga tctgactaag ttatctttta 11940ataattggta ttcctgttta ttgcttgaag aattgccggt cctatttact cgttttagga 12000ctggttca 12008 28 13654 DNA Saccharomyces cerevisiae 28 gaattctcatgtttgacagc ttatcatcga taagctttaa tgcggtagtt tatcacagtt 60 aaattgctaacgcagtcagg caccgtgtat gaaatctaac aatgcgctca tcgtcatcct 120 cggcaccgtcaccctggatg ctgtaggcat aggcttggtt atgccggtac tgccgggcct 180 cttgcgggatatcgtccatt ccgacagcat cgccagtcac tatggcgtgc tgctagcgct 240 atatgcgttgatgcaatttc tatgcgcacc cgttctcgga gcactgtccg accgctttgg 300 ccgccgcccagtcctgctcg cttcgctact tggagccact atcgactacg cgatcatggc 360 gaccacacccgtcctgtgga tcaagcggcc gcagtacgta atgcggtatc gtgaaagcga 420 aaaaaaaactaacagtagat aagacagata gacagataga gatggacgag aaacaggggg 480 ggagaaaaggggaaaagaga aggaaagaaa gactcatcta tcgcagataa gacaatcaac 540 cctcatggcgcctccaacca ccatccgcac tagggaccaa gcgctcgcac cgttagcaac 600 gcttgactcacaaaccaact gccggctgaa agagcttgtg caatgggagt gccaattcaa 660 aggagccgaatacgtctgct cgccttttaa gaggcttttt gaacactgca ttgcacccga 720 caaatcagccactaactacg aggtcacgga cacatatacc aatagttaaa aattacatat 780 actctatatagcacagtagt gtgataaata aaaaattttg ccaagacttt tttaaactgc 840 acccgacagatcaggtctgt gcctactatg cacttatgcc cggggtcccg ggaggagaaa 900 aaacgagggctgggaaatgt ccgtggactt taaacgctcc gggttagcag agtagcaggg 960 ctttcggctttggaaattta ggtgacttgt tgaaaaagca aaatttgggc tcagtaatgc 1020 cactgcagtggcttatcacg ccaggactgc gggagtggcg ggggcaaaca cacccgcgat 1080 aaagagcgcgatgaatataa aagggggcca atgttacgtc ccgttatatt ggagttcttc 1140 ccatacaaacttaagagtcc aattagcttc atcgccaata aaaaaacaag ctaaacctaa 1200 ttctaacaagcacatatgga agacgccaaa aacataaaga aaggcccggc gccattctat 1260 ccgctggaagatggaaccgc tggagagcaa ctgcataagg ctatgaagag atacgccctg 1320 gttcctggaacaattgcttt tacagatgca catatcgagg tggacatcac ttacgctgag 1380 tacttcgaaatgtccgttcg gttggcagaa gctatgaaac gatatgggct gaatacaaat 1440 cacagaatcgtcgtatgcag tgaaaactct cttcaattct ttatgccggt gttgggcgcg 1500 ttatttatcggagttgcagt tgcgcccgcg aacgacattt ataatgaacg tgaattgctc 1560 aacagtatgggcatttcgca gcctaccgtg gtgttcgttt ccaaaaaggg gttgcaaaaa 1620 attttgaacgtgcaaaaaaa gctcccaatc atccaaaaaa ttattatcat ggattctaaa 1680 acggattaccagggatttca gtcgatgtac acgttcgtca catctcatct acctcccggt 1740 tttaatgaatacgattttgt gccagagtcc ttcgataggg acaagacaat tgcactgatc 1800 atgaactcctctggatctac tggtctgcct aaaggtgtcg ctctgcctca tagaactgcc 1860 tgcgtgagattctcgcatgc cagagatcct atttttggca atcaaatcat tccggatact 1920 gcgattttaagtgttgttcc attccatcac ggttttggaa tgtttactac actcggatat 1980 ttgatatgtggatttcgagt cgtcttaatg tatagatttg aagaagagct gtttctgagg 2040 agccttcaggattacaagat tcaaagtgcg ctgctggtgc caaccctatt ctccttcttc 2100 gccaaaagcactctgattga caaatacgat ttatctaatt tacacgaaat tgcttctggt 2160 ggcgctcccctctctaagga agtcggggaa gcggttgcca agaggttcca tctgccaggt 2220 atcaggcaaggatatgggct cactgagact acatcagcta ttctgattac acccgagggg 2280 gatgataaaccgggcgcggt cggtaaagtt gttccatttt ttgaagcgaa ggttgtggat 2340 ctggataccgggaaaacgct gggcgttaat caaagaggcg aactgtgtgt gagaggtcct 2400 atgattatgtccggttatgt aaacaatccg gaagcgacca acgccttgat tgacaaggat 2460 ggatggctacattctggaga catagcttac tgggacgaag acgaacactt cttcatcgtt 2520 gaccgcctgaagtctctgat taagtacaaa ggctatcagg tggctcccgc tgaattggaa 2580 tccatcttgctccaacaccc caacatcttc gacgcaggtg tcgcaggtct tcccgacgat 2640 gacgccggtgaacttcccgc cgccgttgtt gttttggagc acggaaagac gatgacggaa 2700 aaagagatcgtggattacgt cgccagtcaa gtaacaaccg cgaaaaagtt gcgcggagga 2760 gttgtgtttgtggacgaagt accgaaaggt cttaccggaa aactcgacgc aagaaaaatc 2820 agagagatcctcataaaggc caagaagggc ggaaagatcg ccgtgtaatt ggatccagtt 2880 taaacagtagctttggactt cttcgccaga ggtttggtca agtctccaat caaggttgtc 2940 ggcttgtctaccttgccaga aatttacgaa aagatggaaa agggtcaaat cgttggtaga 3000 tacgttgttgacacttctaa ataagcgaat ttcttatgat ttatgatttt tattattaaa 3060 taagttataaaaaaaataag tgtatacaaa ttttaaagtg actcttaggt tttaaaacga 3120 aaattcttgttcttgagtaa ctctttcctg taggtcaggt tgctttctca ggtatagcat 3180 gaggtcgctcttattgacca cacctctacc ggcatgccga gcaaatgcct gcaaatcgct 3240 ccccatttcacccaattgta gatatgctaa ctccagcaat gagttgatga atctcggtgt 3300 gtattttatgtcctcagaag acaacacctg ttgtaatcgt tcttccacac ggatcgcggc 3360 cgcttgatcctctacgccgg acgcatcgtg gccggcatca ccggcgccac aggtgcggtt 3420 gctggcgcctatatcgccga catcaccgat ggggaagatc gggctcgcca cttcgggctc 3480 atgagcgcttgtttcggcgt gggtatggtg gcaggccccg tggccggggg actgttgggc 3540 gccatctccttgcatgcacc attccttgcg gcggcggtgc tcaacggcct caacctacta 3600 ctgggctgcttcctaatgca ggagtcgcat aagggagagc gtcgaccgat gcccttgaga 3660 gccttcaacccagtcagctc cttccggtgg gcgcggggca tgactatcgt cgccgcactt 3720 atgactgtcttctttatcat gcaactcgta ggacaggtgc cggcagcgct ctgggtcatt 3780 ttcggcgaggaccgctttcg ctggagcgcg acgatgatcg gcctgtcgct tgcggtattc 3840 ggaatcttgcacgccctcgc tcaagccttc gtcactggtc ccgccaccaa acgtttcggc 3900 gagaagcaggccattatcgc cggcatggcg gccgacgcgc tgggctacgt cttgctggcg 3960 ttcgcgacgcgaggctggat ggccttcccc attatgattc ttctcgcttc cggcggcatc 4020 gggatgcccgcgttgcaggc catgctgtcc aggcaggtag atgacgacca tcagggacag 4080 cttcaaggatcgctcgcggc tcttaccagc ctaacttcga tcactggacc gctgatcgtc 4140 acggcgatttatgccgcctc ggcgagcaca tggaacgggt tggcatggat tgtaggcgcc 4200 gccctataccttgtctgcct ccccgcgttg cgtcgcggtg catggagccg ggccacctcg 4260 acctgaatggaagccggcgg cacctcgcta acggattcac cactccaaga attggagcca 4320 atcaattcttgcggagaact gtgaatgcgc aaaccaaccc ttggcagaac atatccatcg 4380 cgtccgccatctccagcagc cgcacgcggc gcatctcggg cagcgttggg tcctggccac 4440 gggtgcgcatgatcgtgctc ctgtcgttga ggacccggct aggctggcgg ggttgcctta 4500 ctggttagcagaatgaatca ccgatacgcg agcgaacgtg aagcgactgc tgctgcaaaa 4560 cgtctgcgacctgagcaaca acatgaatgg tcttcggttt ccgtgtttcg taaagtctgg 4620 aaacgcggaagtcagcgccc tgcaccatta tgttccggat ctgcatcgca ggatgctgct 4680 ggctaccctgtggaacacct acatctgtat taacgaagcg ctggcattga ccctgagtga 4740 tttttctctggtcccgccgc atccataccg ccagttgttt accctcacaa cgttccagta 4800 accgggcatgttcatcatca gtaacccgta tcgtgagcat cctctctcgt ttcatcggta 4860 tcattacccccatgaacaga aattccccct tacacggagg catcaagtga ccaaacagga 4920 aaaaaccgcccttaacatgg cccgctttat cagaagccag acattaacgc ttctggagaa 4980 actcaacgagctggacgcgg atgaacaggc agacatctgt gaatcgcttc acgaccacgc 5040 tgatgagctttaccgcagct gcctcgcgcg tttcggtgat gacggtgaaa acctctgaca 5100 catgcagctcccggagacgg tcacagcttg tctgtaagcg gatgccggga gcagacaagc 5160 ccgtcagggcgcgtcagcgg gtgttggcgg gtgtcggggc gcagccatga cccagtcacg 5220 tagcgatagcggagtgtata ctggcttaac tatgcggcat cagagcagat tgtactgaga 5280 gtgcacgatatccggtgtga aataccgcac agatgcgtaa ggagaaaata ccgcatcagg 5340 cgctcttccgcttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg 5400 gtatcagctcactcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga 5460 aagaacatgtgagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 5520 gcgtttttccataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 5580 aggtggcgaaacccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 5640 gtgcgctctcctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 5700 ggaagcgtggcgctttctca atgctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 5760 cgctccaagctgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 5820 ggtaactatcgtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 5880 actggtaacaggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 5940 tggcctaactacggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca 6000 gttaccttcggaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 6060 ggtggtttttttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 6120 cctttgatcttttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 6180 ttggtcatgagattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 6240 tttaaatcaatctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 6300 agtgaggcacctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 6360 gtcgtgtagataactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 6420 ccgcgagacccacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 6480 gccgagcgcagaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 6540 cgggaagctagagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 6600 gcaggcatcgtggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 6660 cgatcaaggcgagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 6720 cctccgatcgttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 6780 ctgcataattctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 6840 tcaaccaagtcattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 6900 acacgggataataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 6960 tcttcggggcgaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 7020 actcgtgcacccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 7080 aaaacaggaaggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 7140 ctcatactcttcctttttca atattattga agcatttatc agggttattg tctcatgagc 7200 ggatacatatttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 7260 cgaaaagtgccacctgacgt ctaagaaacc attattatca tgacattaac ctataaaaat 7320 aggcgtatcacgaggccctt tcgtcttcaa gaattccacg gactatagac tatactagta 7380 tactccgtctactgtacgat acacttccgc tcaggtcctt gtcctttaac gaggccttac 7440 cactcttttgttactctatt gatccagctc agcaaaggca gtgtgatcta agattctatc 7500 ttcgcgatgtagtaaaacta gctagaccga gaaagagact agaaatgcaa aaggcacttc 7560 tacaatggctgccatcatta ttatccgatg tgacgctgca gaagcagaaa tacacgcggt 7620 cagtgaagctattccgctat tgaataacct cagtcacctt gtgcaagaac ttaacaagaa 7680 accaattattaaaggcttac ttactgatag tagatcaacg atcagtataa ttaagtctac 7740 aaatgaagagaaatttagaa acagattttt tggcacaaag gcaatgagac ttagagatga 7800 agtatcaggtaataatttat acgtatacta catcgagacc aagaagaaca ttgctgatgt 7860 gatgacaaaacctcttccga taaaaacatt taaactatta actaacaaat ggattcatta 7920 gatctattacattatgggtg gtatgttgga ataaaaatca actatcatct actaactagt 7980 atttacgttactagtatatt atcatatacg gtgttagaag atgacgcaaa tgatgagaaa 8040 tagtcatctaaattagtgga agctgaaacg caaggattga taatgtaata ggatcaatga 8100 atattaacatataaaatgat gataataata tttatagaat tgtgtagaat tgcagattcc 8160 cttttatggattcctaaatc ctcgaggaga acttctagta tatctacata cctaatatta 8220 ttgccttattaaaaatggaa tcccaacaat tacatcaaaa tccacattct cttcaaaatc 8280 aattgtcctgtacttccttg ttcatgtgtg ttcaaaaacg ttatatttat aggataatta 8340 tactctatttctcaacaagt aattggttgt ttggccgagc ggtctaaggc gcctgattca 8400 agaaatatcttgaccgcagt taactgtggg aatactcagg tatcgtaaga tgcaagagtt 8460 cgaatctcttagcaaccatt atttttttcc tcaacataac gagaacacac aggggcgcta 8520 tcgcacagaatcaaattcga tgactggaaa ttttttgtta atttcagagg tcgcctgacg 8580 catatacctttttcaactga aaaattggga gaaaaaggaa aggtgagagc cgcggaaccg 8640 gcttttcatatagaatagag aagcgttcat gactaaatgc ttgcatcaca atacttgaag 8700 ttgacaatattatttaagga cctattgttt tttccaatag gtggttagca atcgtcttac 8760 tttctaacttttcttacctt ttacatttca gcaatatata tatatatatt tcaaggatat 8820 accattctaatgtctgcccc taagaagatc gtcgttttgc caggtgacca cgttggtcaa 8880 gaaatcacagccgaagccat taaggttctt aaagctattt ctgatgttcg ttccaatgtc 8940 aagttcgatttcgaaaatca tttaattggt ggtgctgcta tcgatgctac aggtgtccca 9000 cttccagatgaggcgctgga agcctccaag aaggttgatg ccgttttgtt aggtgctgtg 9060 ggtggtcctaaatggggtac cggtagtgtt agacctgaac aaggtttact aaaaatccgt 9120 aaagaacttcaattgtacgc caacttaaga ccatgtaact ttgcatccga ctctctttta 9180 gacttatctccaatcaagcc acaatttgct aaaggtactg acttcgttgt tgtcagagaa 9240 ttagtgggaggtatttactt tggtaagaga aaggaagacg atggtgatgg tgtcgcttgg 9300 gatagtgaacaatacaccgt tccagaagtg caaagaatca caagaatggc cgctttcatg 9360 gccctacaacatgagccacc attgcctatt tggtccttgg ataaagctaa tgttttggcc 9420 tcttcaagattatggagaaa aactgtggag gaaaccatca agaacgaatt ccctacattg 9480 aaggttcaacatcaattgat tgattctgcc gccatgatcc tagttaagaa cccaacccac 9540 ctaaatggtattataatcac cagcaacatg tttggtgata tcatctccga tgaagcctcc 9600 gttatcccaggttccttggg tttgttgcca tctgcgtcct tggcctcttt gccagacaag 9660 aacaccgcatttggtttgta cgaaccatgc cacggttctg ctccagattt gccaaagaat 9720 aaggtcaaccctatcgccac tatcttgtct gctgcaatga tgttgaaatt gtcattgaac 9780 ttgcctgaagaaggtaaggc cattgaagat gcagttaaaa aggttttgga tgcaggtatc 9840 agaactggtgatttaggtgg ttccaacagt accacggaag tcggtgatgc tgtcgccgaa 9900 gaagttaagaaaatccttgc ttaaaaagat tctctttttt tatgatattt gtacataaac 9960 tttataaatgaaattcataa tagaaacgac acgaaattac aaaatggaat atgttcatag 10020 ggtagacgaaactatatacg caatctacat acatttatca agaaggagaa aaaggaggat 10080 gtaaaggaatacaggtaagc aaattgatac taatggctca acgtgataag gaaaaagaat 10140 tgcactttaacattaatatt gacaaggagg agggcaccac acaaaaagtt aggtgtaaca 10200 gaaaatcatgaaactatgat tcctaattta tatattggag gattttctct aaaaaaaaaa 10260 aaatacaacaaataaaaaac actcaatgac ctgaccattt gatggagttt aagtcaatac 10320 cttcttgaaccatttcccat aatggtgaaa gttccctcaa gaattttact ctgtcagaaa 10380 cggccttaacgacgtagtcg acctcctctt cagtactaaa tctaccaata ccaaatctga 10440 tggaagaatgggctaatgca tcatccttac ccagcgcatg taaaacataa gaaggttcta 10500 gggaagcagatgtacaggct gaacccgagg ataatgcgat atcccttagt gccatcaata 10560 aagattctccttccacgtag gcgaaagaaa cgttaacaca ccctggataa cgatgatctg 10620 gagatccgttcaacgtggta tgttcagcgg ataatagacc tttgactaat ttatcggata 10680 gtcttttgatgtgagcttgg tcgttgtcaa attctttctt catcaatctc gcagcttcac 10740 caaatcccgctaccaatggg ggggccaaag taccagatct caatcctctc tcttggccac 10800 caccggatagtaaaggttct aatctaactc ttggtctcct tcttacatag atggcaccta 10860 ttccctttggaccgtaaatc ttgtgagaag aaattgatag taaatcaatg ttcatttcat 10920 tgacatcaatgtgaatctta ccataggctt gtgcggcgtc agtatgaaag tagatcttat 10980 tctttctacaaattgcacca atttctttaa taggttgaat gacaccgatt tcattattga 11040 cagccatcacagagacgaga caggtatctg gtctaatggc atcttccaat tccttcaaat 11100 cgataagaccttgatcgtcc acatttagga aagtgacttc aaatccctcc ttcatcatgg 11160 cccgtgcggcttccaagaca cacttgtgtt ccgttctagt ggtgatgatg tgtttcttag 11220 tcttcttataaaatcttggg acacccttaa gaaccatatt attagattcg gtcgctcccg 11280 aagtgaatattatttccttg gggtcggcat tgatcatctt tgctacgtaa gctctagcat 11340 tttccacagcagtatttgtt tcccaaccgt aagagtgagt gttggaatga ggattaccat 11400 aaagtcccgtataaaacttc aacatcgtat ccaaaaccct agggtctgtt ggtgtagtgg 11460 cttgcatgtcaagatatatg ggacgagtac caaaacctgt gttttcttga taagcatggc 11520 tcattgcagtgctaccagaa gctactacag catctggggt ggtaccggat gcactcgcac 11580 gggcactagcctgtgccttt gcagcagcct gaatatcggt atgcgtttcc agagagaagt 11640 tgtcgtctaacttcacgcct gctgcagtct caatgatatt cgaatacgct ttgaggagat 11700 acagcctaatatccgacaaa ctgttttaca gatttacgat cgtacttgtt acccatcatt 11760 gaattttgaacatccgaacc tgggagtttt ccctgaaaca gatagtatat ttgaacctgt 11820 ataataatatatagtctagc gctttacgga agacaatgta tgtatttcgg ttcctggaga 11880 aactattgcatctattgcat aggtaatctt gcacgtcgca tccccggttc attttctgcg 11940 tttccatcttgcacttcaat agcatatctt tgttaacgaa gcatctgtgc ttcattttgt 12000 agaacaaaaatgcaacgcga gagcgctaat ttttcaaaca aagaatctga gctgcatttt 12060 tacagaacagaaatgcaacg cgaaagcgct attttaccaa cgaagaatct gtgcttcatt 12120 tttgtaaaacaaaaatgcaa cgcgagagcg ctaatttttc aaacaaagaa tctgagctgc 12180 atttttacagaacagaaatg caacgcgaga gcgctatttt accaacaaag aatctatact 12240 tcttttttgttctacaaaaa tgcatcccga gagcgctatt tttctaacaa agcatcttag 12300 attactttttttctcctttg tgcgctctat aatgcagtct cttgataact ttttgcactg 12360 taggtccgttaaggttagaa gaaggctact ttggtgtcta ttttctcttc cataaaaaaa 12420 gcctgactccacttcccgcg tttactgatt actagcgaag ctgcgggtgc attttttcaa 12480 gataaaggcatccccgatta tattctatac cgatgtggat tgcgcatact ttgtgaacag 12540 aaagtgatagcgttgatgat tcttcattgg tcagaaaatt atgaacggtt tcttctattt 12600 tgtctctatatactacgtat aggaaatgtt tacattttcg tattgttttc gattcactct 12660 atgaatagttcttactacaa tttttttgtc taaagagtaa tactagagat aaacataaaa 12720 aatgtagaggtcgagtttag atgcaagttc aaggagcgaa aggtggatgg gtaggttata 12780 tagggatatagcacagagat atatagcaaa gagatacttt tgagcaatgt ttgtggaagc 12840 ggtattcgcaatattttagt agctcgttac agtccggtgc gtttttggtt ttttgaaagt 12900 gcgtcttcagagcgcttttg gttttcaaaa gcgctctgaa gttcctatac tttctagaga 12960 ataggaacttcggaatagga acttcaaagc gtttccgaaa acgagcgctt ccgaaaatgc 13020 aacgcgagctgcgcacatac agctcactgt tcacgtcgca cctatatctg cgtgttgcct 13080 gtatatatatatacatgaga agaacggcat agtgcgtgtt tatgcttaaa tgcgtactta 13140 tatgcgtctatttatgtagg atgaaaggta gtctagtacc tcctgtgata ttatcccatt 13200 ccatgcggggtatcgtatgc ttccttcagc actacccttt agctgttcta tatgctgcca 13260 ctcctcaattggattagtct catccttcaa tgctatcatt tcctttgata ttcgatccta 13320 ggcatagtaccgagaaacta gtgcgaagta gtgatcaggt attgctgtta tctgatgagt 13380 atacgttgtcctggccacgg cagaagcacg cttatcgctc caatttccca caacattagt 13440 caactccgttaggcccttca ttgaaagaaa tgaggtcatc aaatgtcttc caatgtgaga 13500 ttttgggccattttttatag caaagattga ataaggcgca tttttcttca aagctttatt 13560 gtacgatctgactaagttat cttttaataa ttggtattcc tgtttattgc ttgaagaatt 13620 gccggtcctatttactcgtt ttaggactgg ttca 13654 29 600 DNA Saccharomyces cerevisiae 29agaaccaaat gggaaaatcg gaatgggtcc agaactgctt tgagtgctgg ctattggcgt 60ctgatttccg ttttgggaat cctttgccgc gcgcccctct caaaactccg cacaagtccc 120agaaagcggg aaagaaataa aacgccacca aaaaaaaaaa aataaaagcc aatcctcgaa 180gcgtgggtgg taggccctgg attatcccgt acaagtattt ctcaggagta aaaaaaccgt 240ttgttttgga attccccatt tcgcggccac ctacgccgct atctttgcaa caactatctg 300cgataactca gcaaattttg catattcgtg ttgcagtatt gcgataatgg gagtcttact 360tccaacataa cggcagaaag aaatgtgaga aaattttgca tcctttgcct ccgttcaagt 420atataaagtc ggcatgcttg ataatctttc tttccatcct acattgttct aattattctt 480attctccttt attctttcct aacataccaa gaaattaatc ttctgtcatt cgcttaaaca 540ctatatcaat aatgcaattt tctactgtcg cttctatcgc cgctgtcgcc gctgtcgctt 600 30850 DNA Saccharomyces cerevisiae 30 gccacgggtc aacccgattg ggatcaccccactggggccc aagcctgata tccgacctcc 60 atgaaatttt tttttttctt tcgattagcacgcacacaca tcacatagac tgcgtcataa 120 aaatacacta cggaaaaacc ataaagagcaaagcgatacc tacttggaag gaaaaggagc 180 acgcttgtaa gggggatggg ggctaagaagtcattcactt tcttttccct tcgcggtccg 240 gacccgggac ccctcctctc cccgcacgatttcttccttt catatcttcc ttttattcct 300 atcccgttga agcaaccgca ctatgactaaatggtgctgg acatctccat ggctgtgact 360 tgtgtgtatc tcacagtggt aacggcaccgtggctcggaa acggttcctt cgtgacaatt 420 ctagaacagg ggctacagtc tcgataatagaataataagc gcatttttgc tagcgccgcc 480 gcggcgcccg tttcccaata gggaggcgcagtttatcggc ggagctctac ttcttcctat 540 ttgggtaagc ccctttctgt tttcggccagtggttgctgc aggctgcgcc ggagaacata 600 gtgataaggg atgtaacttt cgatgagagaattagcaagc ggaaaaaaac tatggctagc 660 tgggagttgt ttttcaatca tataaaagggagaaattgtt gctcactatg tgacagtttc 720 tgggacgtct taacttttat tgcagaggactatcaaatca tacagatatt gtcaaaaaaa 780 aaaaagacta ataataaaaa atgaagttatctcaagttgt tgtttccgcc gtcgccttca 840 ctggtttagt 850 31 600 DNASaccharomyces cerevisiae 31 aaagaatcca tcactatttg aaaaaaagtc atctggcacgtttaattatc agagcagaaa 60 tgatgaaggg tgttagcgcc gtccactgat gtgcctggtagtcatgattt acgtataact 120 aacacatcat gaggacggcg gcgtcacccc aacgcaaaagagtgacttcc ctgcgctttg 180 ccaaaacccc atacatcgcc atctggctcc tggcagggcggttgatggac atcagccgcc 240 tcccttaatt gctaaagcct ccacaaggca caattaagcaatatttcggg aaagtacacc 300 agtcagtttg cgcttttatg actgggttct aaggtactagatgtgaagta gtggtgacag 360 aatcagggag ataagaggga gcagggtggg gtaatgatgtgcgataacaa tcttgcttgg 420 ctaatcaccc ccatatcttg tagtgagtat ataaataggagcctcccttc ctattgcaac 480 tccataaaat ttttttttgt agccacttct gtaacaagataaataaaacc aactaatcga 540 gatatcaaat atgggtagtt tttgggacgc attcgcagtatacgacaaga aaaagcacgc 600 32 600 DNA Saccharomyces cerevisiae 32ttcaggagtc tctcgcgtta gagcagtacg tggcgcagct aaactcgccg ggaggtctgc 60ttcacgagcg cggtgtgcgc ctagtattgc cccgacggtc cgggtgccta tccctagatt 120tcgtcgtgcc ccgacccaaa tagttaaacg tgtggtttat gggtgcacca gggctttatc 180gtgttttata tcgatggcga tttgtgcctc cagtgtattt ttgtatatcc aattaaggtt 240tcttacctaa ttttattttt atcatcttta gttaatgctg gtttgctctg tttctgctgc 300tttctgtgcg gttctcctct tctcttgttt cttcgtgttg tcccccatcg ccgatgggct 360tatatggcgt atatatatag agcgagtttt tacgtcgaag atcatctcag tttgcttgat 420agcctttcta ctttattact ttcgttttta acctcattat actttagttt tctttgatcg 480gtttttttct ctgtatactt aaaagttcaa atcaaagaaa catacaaaac tacgtttata 540tcaattaata atgtctgaaa ttcaaaacaa agctgaaact gccgcccaag atgtccaaca 600

What is claimed is:
 1. An isolated and purified polynucleotideconsisting of SEQ ID NO: 4, wherein the polynucleotide is operative as apromoter to express a nucleic acid molecule encoding a polypeptide whenoperably linked to said nucleic acid molecule.
 2. A yeast expressionvector comprising the polynucleotide of claim
 1. 3. A yeast celltransformed with the yeast expression vector of claim
 1. 4. A method forproducing a polypeptide comprising the steps of: (a) constructing ayeast expression vector wherein a nucleic acid encoding the polypeptideis controlled by the polynucleotide of claim 1; (b) transforming aculture of yeast cells with the yeast expression vector; (c) maintainingthe yeast cells in culture so that the polypeptide is expressed; and (d)recovering the polypeptide.
 5. A method for producing a polypeptidecomprising the steps of: (a) cloning a nucleic acid molecule encodingthe polypeptide into an expression vector selected from the groupconsisting of pZEO1P+luc and pZEO1P, wherein the nucleic acid moleculeis operably linked to a promoter of the expression vector; (b)transforming a culture of yeast cells with the yeast expression vector;(c) maintaining the yeast cells in culture so that the polypeptide isexpressed; and (d) recovering the polypeptide.
 6. A method for producinga polypeptide comprising the steps of: (a) constructing a yeastexpression vector wherein a nucleic acid molecule encoding thepolypeptide is controlled by the polynucleotide of claim 1; (b)transforming a culture of yeast calls with the yeast expression vector;(c) maintaining the yeast cells in culture medium and controlling theexpression of the nucleic acid molecule encoding the polypeptide byvarying the level of a torment able carbon source in the culture medium;and (d) recovering the polypeptide.
 7. The method of claim 6 wherein thefermentable carbon source is glucose.
 8. A method for producing apolypeptide comprising the steps of: (a) constructing a yeast expressionvector wherein a nucleic acid molecule encoding the polypeptide iscontrolled by the polynucleotide of claim 1; (b) transforming a cultureof yeast cells with the yeast expression vector; (c) maintaining thoyeast cells in culture medium and controlling the expression of thenucleic acid molecule encoding the polypeptide by varying the level of anon-fermentable carbon source in the culture medium; and (d) recoveringthe polypeptide.
 9. The method of claim 8 wherein the non-fermentablecarbon source is ethanol.
 10. A method for producing a polypeptidecomprising the steps of: (a) constructing a yeast expression vectorwherein a nucleic acid molecule encoding the polypeptide is controlledby the polynucleotide of claim 1; (b) transforming a culture of yeastcells with the yeast expression vector; (c) maintaining the yeast cellsin culture medium and controlling the expression of the nucleic acidmolecule encoding the polypeptide by varying the level of a fermentablecarbon source and a non-fermentable carbon source in the culture medium,and (d) recovering the polypeptide.
 11. The method of claim 10 whereinthe fermentable carbon source is glucose.
 12. The method of claim 10wherein the non-fermentable carbon source is ethanol.